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PCT/DS9S/04228 
PROTEIN TYROSINE KINASE AGONIST ANTIBODIES 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

The present invention relates to novel protein tyrosine kinase (pTK) 
5 genes, the proteins encoded by these genes, RNA nucleic acid sequences 
which hybridize to the genes, antibodies specific for the encoded proteins, 
chimeras of the proteins and methods of use therefor. 

In particular, this application relates to agonist antibodies which 
are able to activate the tyrosine kinase domain of the receptor pTKs 
10 disclosed herein and pTK- immunoglobulin chimeras. 

DESCRIPTION OF RELATED APT 

Transduction of signals that regulate cell growth and differentiation 
is regulated in part by phosphorylation of various cellular proteins. 
Protein tyrosine kinases are enzymes that catalyze this process. Moreover, 

15 many act as growth factor receptors. The c-kit subgroup of receptor 
tyrosine kinases catalyze the phosphorylation of exogenous substrates, as 
well as tyrosine residues within their own polypeptide chains (Ullrich et 
al.i CeU £1:203 [1990]). Members of the c-kit subgroup include FLT/FLK 
(Fetal Liver Kinase) , PGF (Fibroblast Growth Factor Receptor) and NGF 

20 (Nerve Growth Factor Receptor) . 

The EPH tyrosine kinase subfamily, Eph, Elk, Eck, Eek, Hek, Hek2, 
Sek, Ehk-1, Ehk-2, Cek-4 to -10, Tyro 1, 4, 5 and 6, appears to be the 
largest subfamily of transmembrane tyrosine kinases (Hirai et al . , Science 
2M:1717-1720 [1987]; Letwin et al., Oncogene 3:62l-fi7fl [1988]; Lhotak et 

25 aJ. ( Mol. Cell, pig*. 11:7071-7079 [1993]; Lindberg et al., Mol. Cell. 
£102^112:6316-6324 [1990]; Bohme et al. ( QnsS3Sn& fi: 2857-2862 [1993]; and 
Wicks et al., Proc. Natl. Acad. Sci. USA. ag-isii -ict* [1992]; Pasquale et 
*1« Cell Regulation. 2:523-534 [1991]; Sajjadi et al., New Biol. 2:769-778 
[1991]; Wicks et al., Proc. Natl. Acad. Sci. USA. £2:1611-1615 [1992]; 

30 Lhotak et al., Mol, Cell, Bio, 11:2496-2502 [1991]; Gilardi-Hebenstreit et 
al., Oncogene 2:2499-2506 [1992]; Lai et al., Neuron £:691-704 [1991]; 
Sajjadi et al., oncogene &:1807-1813 [1993]; and Maisonpierre et al., 
Qncpgene £:3277-3288 [1993]). 

Additional pTKs and agonist antibodies thereto are needed in order 

35 to further study growth and differentiation of cells, for use as 
therapeutic agents and for diagnostic purposes. Accordingly, it is an 
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object of the present invention to provide novel pTK genes, the proteins 
encoded thereby, antibodies specific for the encoded proteins, chimeras of 
the proteins and methods of use thereof. 



SUMMARY OF THE INVENTION 

5 The genes isolated as described herein are referred to, collectively, 

as "protein tyrosine kinase genes" or "pTK genes". The nucleic acid 
sequences of some of these genes, isolated as discussed herein, show 
significant homology with previously identified protein tyrosine kinases 
containing extracellular domains, which function as growth factor receptors 

10 (e.g. , pTKs of the c-kit subgroup) . Some of the pTK genes have been shown 
to be present in both megakaryocytic and lymphocytic cells. 

In particular, fourteen pTK genes have been identified. Two pTK 
genes, referred to as SAL-SI and SAL-D4 were identified in megakaryocytic 
cells. SAL-D4 is related to the CSK family of intracellular pTKs and SAL-SI 

15 is related to the FGP receptor family of pTKs. Five pTK genes, referred to 
as LpTKs, were identified in lymphocytic cells and have been shown to be 
present in megakaryocytes as well. One pTK gene, referred to as HpTKS, was 
identified in human hepatoma cells. Six pTK genes, referred to as bpTK 
genes, were found in human brain tissue. 

20 The pTK genes, which are the subject of the present invention, were 

generally identified using two sets of degenerative oligonucleotide 
primers: a first set which amplifies all pTK DNA segments (SEQ ID NOS: 1- 
2) , and a second set which amplifies highly conserved sequences present in 
the catalytic domain of the c-kit subgroup of pTKs (SEQ ID NOS: 3-4) . The 

25 pTK genes identified in this manner are described below. 

SAL-SI is expressed in several megakaryocytic cell lines, but not in 
erythroid cell lines. The nucleotide sequence of part of SAL-Sl was 
obtained, revealing a sequence containing 160 base pairs (SEQ ID NO: 5) . 
This isolated DNA fragment encoded an amino acid sequence (SEQ ID NO: 6) 

30 which exhibited significant sequence homology with known protein tyrosine 
kinases of the FLT/FLK family. The deduced amino acid sequence of SAL- SI 
(SEQ ID NO: 32) contains 1298 residues. 

SAL-D4, also expressed in megakaryocytic cells, is a DNA fragment 
containing the nucleotide sequence of 147 base pairs. (SEQ ID NO: 7) . This 

35 isolated DNA fragment encoded an amino acid sequence (SEQ ID NO: 8) which 
exhibited significant sequence homology with known protein tyrosine kinases 
of the CSK intracellular pTK family. 
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The LpTKs, including LpTK 2, LpTK 3, LpTK 4, LpTK 13 and 
LpTK 25, are expressed in lymphocytic cells, as well as megakaryocytic 
cells. The nucleotide sequence (151 base pairs) of the LpTK 3 gene was 
obtained (SEQ ID NO: 11). The nucleotide sequences of the LpTK 2, LpTK 4, 
5 and LpTK 13 genes contained 149 base pairs (SEQ ID NO: 9), 137 base pairs 
(SEQ ID NO: 13), and 211 base pairs (SEQ ID NO: 15) respectively. LpTK 25 
has a nucleotide sequence of 3120 b.p. {SEQ ID NO: 22) . A full length gene 
sequence has been obtained for LpTK 2 (SEQ ID NO: 19) which contains 7607 
b.p. Additional sequencing of LpTK 4 revealed a sequence of 404 b.p. (SEQ 
10 ID NO: 21) . 

The HpTK5 gene, expressed in human hepatoma cells, has a nucleotide 
sequence of 3969 b.p. (SEQ ID NO: 23) . 

Nucleotide sequences of the bpTKs, including bpTK 1, bpTK 2, bpTK 3, 
bpTK 4, bpTK 5 and bpTK 7, are expressed in human brain tissue and encode 
15 proteins having the amino acid sequences of SEQ ID NOS: 25-29 and 34 
respectively! 

Thus, the present invention includes DNA isolated from a human 
megakaryocytic cell line, which hybridizes to DNA encoding an amino acid 
sequence which is highly conserved in the catalytic domain of protein 

20 tyrosine kinases of the c-kit subgroup. 

The present invention also includes the proteins encoded by the pTK 
genes identified as described herein, which exhibit significant sequence 
homology with members of the c-kit subgroup of pTKs as well as the proteins 
encoded by HpTK5 and the bpTKs. The present invention also includes SAL- 

25 SI, SAL-D4, LpTK, HpTKS and bpTK homologues or equivalents (i.e., proteins 
which have amino acid sequences substantially similar, but not identical, 
to that of SAL-SI, SAL-D4, the LpTKs, HpTK5 and the bpTKs, which exhibit 
tyrosine kinase activity) . This invention further includes peptides (SAL- 
SI, SAL-D4 , LpTK, HpTK5 and bpTK fragments) which retain tyrosine kinase 

30 activity, yet are less than the entire SAL-SI, SAL-D4, LpTK, HpTKS and bpTK 
sequences; and uses for the SAL-SI, SAL-D4 , the LpTK, HpTK and the bpTK 
nucleic acid sequences and SAL-SI, SAL-D4, LpTK, HpTK and bpTK equivalents. 

The present invention further includes nucleic acid sequences which 
hybridize with DNA or RNA encoding the proteins described herein, which 

35 exhibit significant sequence homology with the FLT/FLK, FGF receptor or NGF 
receptor family of protein tyrosine kinases contained within the c-kit 
subgroup. Such nucleic acid sequences are useful as probes to identify pTK 
genes in other vertebrates, particularly mammals, and in other cell types. 
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They can also be used as anti-sense oligonucleotides to inhibit protein 
tyrosine kinase activity, both in vitro and in vivo. 

The SAL-Sl, SAL-D4, LpTK, HpTK and bpTK tyrosine kinases of the 
present invention can be used as target proteins in conjunction with the 
5 development of drugs and therapeutics to modulate cell growth, • 
differentiation and other metabolic functions. The SAL-SI, SAL-D4, LpTK, ^ 
HpTK or bpTK proteins can be used as agonists or antagonists to other 4 
tyrosine kinases. The pTKs can also be instrumental in the modulation of 
megakaryocyte and/or platelet adhesion interactions. 

10 In addition, the SAL-SI, SAL-D4, LpTK, HpTK and bpTK tyrosine kinases 

can be used in screening assays to detect cellular growth and/or 
differentiation factors. Using standard laboratory techniques, the ligands 
of the pTKs of the present invention can be identified. In particular, the 
invention provides chimeric pTK- immunoglobulin fusion proteins which are 

15 useful for isolating ligands to the pTKs disclosed herein. The chimeric 
proteins are also useful for diagnostic assays designed to detect these 
ligands present endogenously, within cells, as well as exogenously, in 
extra-cellular fluids. Assays, using the chimeric proteins, can also be 
designed as diagnostic aids to detect these ligands in body fluids such as 

20 blood and urine. 

In another aspect, the invention provides antibodies specific for 
SAL-SI, SAL-D4, the LpTKs, HpTK5 and the bpTKs, which are optionally 
agonists for their respective pTK (where the pTK is a receptor) . The 
invention also concerns a hybridoma cell line and an isolated nucleic acid 

25 encoding a monoclonal antibody as herein defined. 

Also, the invention pertains to a method for activating a pTK as 
herein disclosed, comprising reacting the pTK with an agonist antibody 
thereto. In a different aspect, the invention concerns a method for 
enhancing cell growth and/or differentiation comprising administering to 

30 a human patient in need of such treatment a physiologically effective 
amount of an agonist antibody which activates a pTK as herein disclosed. 

In a still further aspect, the invention concerns a method for 
detecting a pTK by contacting a source suspected of containing the pTK with ^ 
a detectably labeled monoclonal antibody which reacts immunologically with 
35 the pTK, and determining whether the antibody binds to the source. 
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BRIEF DESCRIPTION OF TOP. royTHffF 
Figures 1A and IB depict the nucleotide sequence of SAL- SI (SEQ ID 
NO: 5) and its deduced amino acid sequence (SEQ ID NO: 6) . 

Figures 2A and 2B depict the nucleotide sequence of SAL-D4 (SEQ ID 

# 5 NO: 7) and its deduced amino acid sequence (SEQ ID NO: 8) . 

# Figure 3A depicts the nucleotide sequence of LpTK 2 (SEQ ID NO: 9) 

# and its deduced amino acid sequence (SEQ ID NO: 10). 

Figure 3B depicts the nucleotide sequence of LpTK 3 (SEQ ID NO: 11) 
and its deduced amino acid sequence (SEQ ID NO: 12) . 
10 Figure 3C depicts the nucleotide sequence of LpTK 4 (SEQ ID NO: 13) 

and its deduced amino acid sequence (SEQ ID NO: 14). 

Figure 3D depicts the nucleotide sequence of LpTK 13 (SEQ ID NO: 15) 
and its deduced amino acid sequence (SEQ ID NO: 16) . 

Figures 4A-4I depict the nucleotide sequence (SEQ ID NO: 17) of SAL- 
15 SI and its deduced amino acid sequence (SEQ ID NO: 18) . 

Figures 5A-5K depict the full length nucleotide sequence (SEQ ID NO: 
19) of LpTK2 and its deduced amino acid sequence (SEQ ID NO: 20) . 

Figure 6 depicts the partial nucleotide sequence (SEQ ID NO: 21) for 

LpTK4. 

20 Figures 7A-7C depict the full length nucleotide sequence (SEQ ID NO: 

22) for LpTK25. 

Figures 8A-8I depict the full length nucleotide sequence (SEQ ID NO: 

23) and the deduced amino acid sequence of HpTKS (SEQ ID NO: 24) . 

Figure 9 depicts the amino acid sequence (SEQ ID NO: 25) of bpTKl. 
25 Figure 10 depicts the amino acid sequence (SEQ ID NO: 26) of bpTK2. 

Figure 11 depicts the amino acid sequence (SEQ ID NO: 27) of bpTK3. 

Figure 12 depicts the amino acid sequence (SEQ ID NO: 28) of bpTK4. 

Figure 13 depicts the amino acid sequence (SEQ ID NO: 29) of bpTK5. 

Figure 14 depicts the amino acid sequence (SEQ ID NO: 30) of bpTK7. 
30 Figures 15A-15F depict the full-length nucleotide sequence of SAL-SI 

(SEQ ID NO: 31) and its deduced amino acid sequence (SEQ ID NO: 32) . 

Figures 16A-16H depict the full-length nucleotide sequence of bpTK7 
(SEQ ID NO: 33) and its deduced amino acid sequence (SEQ ID NO: 34) . 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

35 Novel protein tyrosine kinase genes have been identified, their 

nucleic acid sequences determined, and the amino acid sequences of the 
encoded proteins deduced. The genes isolated as described herein are 

-5- 
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referred to, collectively, as "protein tyrosine kinase genes" or "pTK 
genes " . 

To facilitate the isolation and identification of these novel pTKs, 
two sets of DNA probes were used, as described in Example 1. The first set 
5 generally consisted of two degenerative oligonucleotide sequences, pTK 1 ? 
(SEQ ID NO: 1) and pTK 2 (SEQ ID NO: 2) (Matthews, Cell ££:1143 [1991]; and ' 
Wilks, Proc. Natl. Acad. Sci. USA 86:1603 [1989]). These sequences were % 
used as primers in a polymerase chain reaction to amplify tyrosine kinase 
DNA segments (Mullis, et al., Cold Soring Harbor Symp. Advan. Biol. £1:2(53 
10 [1986]). 

The second set generally consisted of two oligonucleotide sequences, 
pTK 3 (SEQ ID NO: 3) and pTKKW (SEQ ID NO: 4) designed to amplify the 
nucleic acid sequence which encodes the highly conserved regions of the 
catalytic domains of the c-kit family of protein tyrosine kinases. These 

15 sequences were used as primers in the polymerase chain reaction (PCR) in 
a second round of DNA amplification. Using this two-step amplification 
procedure, DNA fragments which hybridized to these pTK primers were 
identified, isolated and subsequently sequenced. 

In particular, fourteen pTK genes have been identified. Two pTK 

20 genes, referred to as SAL-SI and SAL-D4, were identified in several 
megakaryocytic cell lines, including CMK 11-5, DAMI, UT-7 and UT-7 grown 
in erythropoietin, but not in the erythroid cell lines HEL, PMA stimulated 
HEL cells, or K562. Five pTK genes, referred to as LpTKs, were identified 
in lymphocytic, as well as in megakaryocytic cells. One pTK gene, referred 

25 to as HpTK5, was identified in human hepatoma cells, and six genes, 
referred to as bpTKs, were identified in human brain tissue. 

SAL-SI (SEQ ID NOS: 6, 18 and 32) encoded by the nucleic acid 
sequence of SEQ ID NOS: 5, 17 and 31 exhibits significant homology with the 
FLT/FLR family of pTKs. SAL-SI has a signal peptide (i.e., amino acid 

30 residues 1 to 24 of Figure 15); extracellular domain (i.e., amino acid 
residues 25 to 775 of Figure 15); transmembrane domain (i.e., amino acid 
residues 776 to 800 of Figure 15) and a cytoplasmic tyrosine kinase domain 
(i.e., amino acid residues 801 to 1298 of Figure 15) . SAL-D4 (SEQ ID NO: 
8) encoded by SEQ ID NO: 7 is related to the CSK family of intracellular % 

35 pTKs. The LpTKs, LpTK 2 (SEQ ID NOS: 10 and 20) encoded by SEQ ID NOS: 9 
and 19; LpTK 3 (SEQ ID NO: 12) encoded by SEQ ID NO: 11; LpTK4 (SEQ ID NO: 
14) encoded by SEQ ID NOS: 13 and 21; LpTK13 (SEQ ID NO: 16) encoded by SEQ 
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ID NO: 15; and LpTK25 encoded by SEQ ID NO: 22, also exhibit sequence 
homology with known protein tyrosine kinases. 

HpTK5 (SEQ ID NO: 24) encoded by SEQ ID NO: 23 and the bpTKs 1, 2, 
3, 4, 5 and 7 (SEQ ID NOS: 25-29 and 34 respectively), similarly exhibit 
sequence homology with known protein tyrosine kinases. BpTK7 encodes a 
receptor pTK with a signal peptide (i.e., amino acid residues 1-19 of 
Figure 16); extracellular domain (i.e., amino acid residues 20-547 of 
Figure 16); and transmembrane domain (i.e., amino acid residues 548-570 of 
Figure 16) . The remaining sequence comprises the intracellular tyrosine 
kinase domain. 

Thus, as described above, DNA molecules which hybridize with DNA 
encoding amino acid sequences present in the catalytic domain of a protein 
tyrosine kinase of the c-kit subgroup of protein kinases have been isolated 
and sequenced. These isolated DNA sequences, collectively referred to as 
"pTK genes", (and their deduced amino acid sequences) have been shown to 
exhibit significant sequence homology with known members of pTK families. 

Once isolated, these DNA fragments can be amplified using known 
standard techniques such as PCR. These amplified fragments can then be 
cloned into appropriate cloning vectors and their DNA sequences determined. 

These DNA sequences can be excised from the cloning vectors, labeled 
with a radiolabeled nucleotide such as 32 P and used to screen appropriate 
cDNA libraries to obtain the full-length cDNA clone. 

The pTK genes as described above have been isolated from the source 
in which they occur naturally, e.g., megakaryocytic and lymphocytic cells. 
The present invention is intended to include pTK genes produced using 
genetic engineering techniques, such as recombinant technology, as well as 
pTK genes that are synthesized chemically. 

The deduced amino acid sequences of the pTK genes include amino acid 
sequences which encode peptides exhibiting significant homology with the 
catalytic domain of protein tyrosine kinases of the c-kit subgroup of 
tyrosine kinases. These proteins, encoded by the pTK genes, can include 
sequences in which functionally equivalent amino acid residues are 
substituted for residues within the sequence, resulting in a silent change, 
that is a change not detected phenotypically. For example, one or more 
amino acid residues within the sequence can be substituted by another 
amino acid of a similar polarity which acts as a functional equivalent, 
resulting in a silent substitution. 
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In addition, the protein structure can be modified by deletions, 
additions, inversion, insertions or substitutions of one or more amino acid 
residues in the sequence which do not substantially detract from the 
desired functional tyrosine kinase properties of the peptide. 
5 Modified pTKs of the present invention, with tyrosine kinase 

activity, can be made using recombinant DNA techniques, such as excising 
it from a vector containing a cDNA encoding such a protein, or by 
synthesizing DNA encoding the desired protein mechanically and/or 
chemically using known techniques. 

10 An alternate approach to producing the pTKs of the present invention 

is to use peptide synthesis to make a peptide or polypeptide having the 
amino acid sequence of such a protein, depending on the length of the pTK 
desired. The peptides or modified equivalents thereof, can be synthesized 
directly by standard solid or liquid phase chemistries for peptide 

15 synthesis . 

Preferably, the pTKs of the present invention will be produced by 
inserting DNA encoding the proteins into an appropriate vector/host system 
where it will be expressed. The DNA sequences can be obtained from sources 
in which they occur naturally, can be chemically synthesized or can be 
20 produced using standard recombinant technology. 

This invention also pertains to an expression vector comprising a pTK 
gene of the present invention, encoding for a protein which exhibits 
receptor tyrosine kinase activity. 

The pTK genes of the present invention can be used for a number of 
25 diagnostic and therapeutic purposes. For example, the nucleic acid 
sequences of the pTK genes can be used as probes to identify other protein 
tyrosine kinases present in other cell types, including eukaryotic and 
prokaryotic cell types. 

The nucleic acid sequences can also be used to design drugs that 
30 directly inhibit the kinase activity of protein tyrosine kinases, or to 
design peptides that bind to the catalytic domain of tyrosine kinases, thus 
inhibiting their activity. These sequences can also be used to design 
anti-sense nucleotides that can also inhibit, or destroy, tyrosine kinase 
activity. Such inhibition of tyrosine kinase activity would be desirable 
35 in pathological states where decreased cellular proliferation would be 
beneficial, such as leukemias or other malignancies. 

The nucleic acid sequences can also be used to design drugs, peptides 
or anti-sense nucleotides as above, but with enhancing, rather than 
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inhibitory effects, on tyrosine kinases. Such enhanced tyrosine kinase 
activity would result in increasing the phosphorylation of substrates 
(exogenous, as well as endogenous tyrosine residues) . Enhanced effects 
would be desirable in states where increased cellular proliferation would 
5 be beneficial, such as anemias, bleeding disorders and during surgical 
procedures . 

The pTK genes of the present invention can also be used to obtain 
soluble fragments of receptor tyrosine kinases, capable of binding their 
respective ligands. pTK genes encoding soluble tyrosine kinase fragments 

10 can be produced using recombinant DNA techniques or synthetically. In 
either case, the DNA obtained encodes a soluble pTK fragment which lacks 
a substantial portion of the hydrophobic transmembrane region to permit 
solubilization of the fragment. 

These soluble pTK protein fragments can be introduced exogenously to 

15 act as competitors with the endogenous, membrane bound pTK for their 
respective ligands, thus inhibiting tyrosine kinase activity. Alternately, 
a modified soluble pTK protein fragment can be introduced which binds the 
ligand but does not activate kinase activity. 

These soluble pTK protein fragments can also be used in binding 

20 assays to detect ligands such as growth and differentiation factors. Once 
these ligands are identified, they may be altered or modified to inhibit 
or enhance kinase activity. For example, the ligands may be modified or 
attached to substances that are toxic to the cell, such a ricin, thus 
destroying the target cell. The substance may be a super -activating 

25 substance which, after binding to the pTK, may substantially increase the 
kinase activity, or activate other growth factors. 

pTK genes of the present invention would also be useful to develop 
diagnostic tools for in vitro screening assays for ligands such as growth 
factors or differentiation factors that inhibit or enhance kinase activity. 

30 The proteins encoded by the pTK genes can also be used in such assays, or 
as immunogens to produce monoclonal or polyclonal antibodies to be used in 
such assays. 

In one embodiment of the invention, a chimera comprising a fusion of 
the extracellular domain of the pTK (where the pTK is a receptor) and an 
35 immunoglobulin constant domain can be constructed which can be used to 
assay for ligands for the receptor and can be used for the production of 
antibodies against the extracellular domain of the receptor. 
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The expression "extracellular domain" or "ECD" when used herein 
refers to any polypeptide sequence that shares a ligand binding function 
of the extracellular domain of the naturally occurring receptor pTKs 
disclosed herein. Ligand binding function of the extracellular domain 
5 refers to the ability of the polypeptide to bind at least one pTK ligand. 
Accordingly, it is not necessary to include the entire extracellular domain 
since smaller segments are commonly found to be adequate for ligand 
binding. The truncated extracellular domain is generally soluble. The term 
ECD encompasses polypeptide sequences in which the hydrophobic 
10 transmembrane sequence (and, optionally, 1-20 amino acids C-terminal and/or 
N-terminal to the transmembrane domain) of the mature pTK has been deleted. 
Thus, the soluble extracellular domain- containing polypeptide can comprise 
the extracellular domain and the cytoplasmic domain of the pTK. 
Alternatively, in the preferred embodiment, the polypeptide comprises only 
15 the extracellular domain of the pTK. The extracellular and transmembrane 
domains of the pTK can be readily determined by the skilled practitioner 
by aligning the pTK of interest with known pTK amino acid sequences for 
which these domains have been delineated. Alternatively, the hydrophobic 
transmembrane domain can be readily delineated based on a hydrophobicity 
20 plot of the sequence. The extracellular domain is N-terminal to the 
transmembrane domain. 

The term "immunoglobulin" generally refers to polypeptides comprising 
a light or heavy chain usually both disulfide bonded in the native "Y" 
configuration, although other linkage between them, including tetramers or 
25 aggregates thereof, is within the scope hereof. 

Immunoglobulins (Ig) and certain variants thereof are known and many 
have been prepared in recombinant cell culture. For example, see U.S. 
Patent 4,745,055; EP 256,654; Faulkner et al. , Nature 22£:286 [19823; EP 
120,694; EP 125,023; Morrison, J. Immun. 122:793 [1979]; Kdhler et al., 
30 Proc, Nat'l, Acad. gc;i t 22:2197 [1980] ; Raso et al., Cancer Res, 

41:2073 [1981]; Morrison et al. , Ann. Rev. Timmmnl o^q [1984]; Morrison, 
Ssisncja 221:1202 [1985]; Morrison et ml., Proc. Nat'l. Acad, sh qpft 
£1:6851 [1984]; EP 255,694; EP 266,663; and WO 88/03559. Reassorted 
immunoglobulin chains also are known. See for example U.S. patent 
35 4,444,878; WO 88/03565; and EP 68,763 and references cited therein. The 
immunoglobulin moiety in the chimera of the present invention may be 
obtained from IgG x , IgG 2 , IgG 3 , or IgG« subtypes, IgA, IgE, IgD or IgM, but 
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preferably igGi or IgG 3 . Most preferably, the immunoglobulin moiety is the 
Pc portion of IgG-y- 

The terms "chimera comprising a fusion of an extracellular domain of 
a pTK with an immunoglobulin constant domain sequence" or w pTK- 
immunoglobulin chimera" refer to a polypeptide comprising an extracellular 
domain coding amino acid sequence of a pTK conjugated to an immunoglobulin 
constant domain sequence. This definition includes chimeras in monomeric, 
homo- or heteromultimeric, and particularly homo- or heterodimeric, or - 
tetrameric forms. 

A preferred embodiment is the fusion of the C- terminus of the 
extracellular domain of a pTK, to the N-terminus of the C-terminal portion 
of an antibody (in particular the Fc domain), containing the effector 
functions of immunoglobulin G x . In a preferred embodiment, the entire heavy 
chain constant region is fused to the extracellular domain. In another 
preferred embodiment, a sequence beginning in the hinge region just 
upstream of the papain cleavage site (which defines IgG Fc chemically; 
residue 216, taking the first residue of heavy chain constant region to be 
114 (Kabat et al., Sequence? of Immunol oaical int^^ci- National Institutes 
of Health, Bethesda, MD, [1987]), or analogous sites of other 
immunoglobulins) is fused to the BCD of the pTK. 

In a particularly preferred embodiment, the pTK extracellular domain 
is fused to the hinge region and C H 2 and C a 3 or C B 1, hinge, C B 2 and C H 3 
domains of an IgG x , IgG, or IgG 3 heavy chain. The precise site at which the 
fusion is made is not critical, and the optimal site can be determined by 
25 routine experimentation. A principal advantage of the chimeras is that 
they are secreted into the culture medium of recombinant hosts, although 
the degree of secretion might be different for various expression systems. 

In general, the chimeras of the present invention are constructed in 
a fashion similar to chimeric antibodies in which a variable domain from 
an antibody of one species is substituted for the variable domain of 
another species. See, for example, EP 0 125 023; EP 173,494; Munro, Nature 
212: [13 December 1984]; Neuberger et al., Uafcur^ 212: [13 December 1984]; 
Sharon et al., Nature 2££: [24 May 1984]; Morrison et al., Proc. Matn. 
Acad, Sci, IZSfl 11:6851-6855 [1984]; Morrison et al. £cjLsnc£ 221:1202-1207 
[1985]; Boulianne et al., Nature 212:643-646 [13 December 1984]; Capon et 
al., nature 212, 525-531 [1989]; Traunecker et al., Hatliis 211, 68-70 
[1989] . 
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To prepare the pTK-Ig chimeric polypeptides, the dna including a 
region encoding the desired pTK sequence is cleaved by a restriction enzyme 
at or proximal to the 3' end of the DNA encoding the immunoglobulin -like 
domain (s) and at a point at or near the DNA encoding the N- terminal end of 
5 the mature pTK (where use of a different leader is contemplated) or at or 
proximal to the N- terminal coding region for the pTK (where the native 
signal is employed) . This DNA fragment then is readily inserted proximal 
to DNA encoding an immunoglobulin light or heavy chain constant region and, 
if necessary, the resulting construct tailored by deletional mutagenesis. 
10 Preferably, the Ig is a human immunoglobulin when the variant is intended 
for in vivo therapy for humans. DNA encoding immunoglobulin light or heavy 
chain constant regions is known or readily available from cDNA libraries 
or is synthesized. See for example, Adams et a!., Biochemistry 19:2711- 
2719 [1980]; Gough et al., Biochemistry 19: 2702-27in [1980]; Dolby et al., 
15 P.N.A.g, ggft, 77:6027-6031 [1980]; Rice et al . , P.N.A.S. USA 79:7B62-7fifiS 
[1982]; Falkner et al., Ha£ujr£ 22fi:286-288 [1982]; and Morrison et al., 
Ann. Rev. Immunol. 2:239-256 [1984] . 

The chimeric proteins disclosed herein are useful as diagnostics for 
isolating or screening ligands for the pTK of interest using the techniques 
20 of Lyman et al., Cell 25:1157-1167 [1993], for example. Also, the chimeric 
proteins are useful for diagnostic purposes for studying the interaction 
of various ligands with the extracellular domain of the various pTKs (see, 
e.g., Bennett et al., J. Biol, ch^m 2££liAi: 23060-23067 [1991]). The 
chimeric proteins are further useful for the production of antibodies 
25 against the extracellular domain of the pTK (see Examples 3 and 5 herein) . 
The chimeric proteins also have an additional therapeutic utility insofar 
as they provide a soluble form of the extracellular domain of the pTK which 
generally has an enhanced plasma half life (compared to the extracellular 
domain only) and therefore can be formulated in a pharmaceutically 
30 acceptable carrier and administered to a patient. The chimeric proteins are 
believed to find use as therapeutic agents for removal of excess systemic 
or tissue- localized pTK ligand which has been administered to a patient. 
Removal of excess ligand is particularly desirably where the ligand may be 
toxic to the patient. The chimeric protein acts to bind the ligand in 
35 competition with the endogenous pTK in the patient. Similarly, it is 
contemplated that the chimeric protein can be administered to a patient 
simultaneously, or subsequent to, administration of the ligand in the form 
of a sustained release composition. The chimeric protein acts as a soluble 
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binding protein for the ligand, thereby extending the half-life of the 
ligand. 

The term "antibody" is used herein in the broadest sense and 
specifically covers polyclonal antibodies, monoclonal antibodies, 
5 immunoglobulin chains or fragments thereof, which react immunologically 
with a pTK. 

t In the preferred embodiment of the invention, the antibodies are 

monoclonal antibodies produced using techniques which are well known in the 
art. For example, the hybridoma technique described originally by Kohler 

10 and Milstein, Eur. J. Immunol. , £:511 [1976], and also described by 
Hammerling et al. ( In: Monoclonal Antibodi es and T-Cell Hvbridomas . 
Elsevier, N.Y., pp. 563-681 [1981] can be used. The techniques of Cote et 
al. and Boerner et al. are also available for the preparation of human 
monoclonal antibodies [Cote et al., Monoclonal Antibodies and Cancer 

15 Therapy. Alan R. Liss, p. 77 [1985] and Boerner et al., J. Immunol. . 

117111:86-95 f 1991 l > • 

The term "monoclonal antibody" as used herein refers to an antibody 
(as hereinabove defined) obtained from a population of substantially 
homogeneous antibodies, i.e., the individual antibodies comprising the 

20 population are identical except for possible naturally occurring mutations 
that may be present in minor amounts. Monoclonal antibodies are highly 
specific, being directed against a single antigenic site. Furthermore, in 
contrast to conventional (polyclonal) antibody preparations which typically 
include different antibodies directed against different determinants 

25 (epitopes) , each monoclonal antibody is directed against a single 
determinant on the antigen. In addition to their specificity, the 
monoclonal antibodies are advantageous in that they can be synthesized by 
a hybridoma culture, uncontaminated by other immunoglobulins. 

"Humanized" forms of non-human (e.g., murine) antibodies are 

30 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab') 2 or other antigen- binding subsequences of antibodies) 
which contain minimal amino acid residues derived from a non-human 
immunoglobulin. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a complementary 

35 determining region (CDR) of the recipient are replaced by residues from a 
CDR of a non-human species (donor antibody) such as mouse, rat or rabbit 
having the desired specificity, affinity and capacity. In some instances, 
Fv framework region (FR) residues of the human immunoglobulin are replaced 
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by corresponding non-human FR residues. Furthermore, a humanized antibody 
may comprise residues which are found neither in the recipient antibody nor 
in the imported CDR or framework sequences. These modifications are made 
to further refine and optimize antibody performance. 
5 The monoclonal antibodies herein include hybrid (chimeric) and 

recombinant antibodies produced by splicing a variable (including 
hypervariable) domain of an anti-pTK antibody with a constant domain (e.g., 
"humanized" antibodies), only one of which is directed against a pTK, or 
a light chain with a heavy chain, or a chain from one species with a chain 
10 from another species, or fusions with heterologous proteins, regardless of 
species of origin or immunoglobulin class or subclass designation, so long 
as they are able to bind to the pTK of interest [See, e.g., Cabilly, et 
al., U.S. Patent No. 4,816,567; and Mage & Lamoyi, in Monoclonal Antibody 
Production Techniques and Applications, pp. 79-97 (Marcel Dekker, Inc., New 
15 York [1987]) . 

For "chimeric" and "humanized" antibodies see, for example, U.S. 
Patent No. 4,816,567; WO 91/09968; EP 452,508; and WO 91/16927. 

Thus, the modifier "monoclonal" indicates the character of the 
antibody as being obtained from a substantially homogeneous population of 
20 antibodies, and is not to be construed as requiring production of the 
antibody by any particular method. 

In the most preferred embodiment of the invention, the antibodies are 
agonist antibodies. By "agonist antibody" is meant an antibody which is 
able to bind to, and activate, a particular pTK. For example, the agonist 
25 may bind to the extracellular domain of the pTK and thereby cause 
dimerization of the pTK, resulting in transphosphorylation and activation 
of the intracellular catalytic kinase domain. Consequently, this may result 
in stimulation of growth and/or differentiation of cells expressing the 
receptor in vitro and/or in vivo. The agonist antibodies herein are 
30 preferably against epitopes within the extracellular domain of the pTK, and 
preferably have the same biological characteristics as the monoclonal 
antibody produced by the hybridoma cell line deposited under American Type. 
Culture Collection Accession No. ATCC HB 11,583. By "biological 
characteristics" is meant the in vitro and/or in vivo activities of the 
35 monoclonal antibody, e.g., ability to activate the kinase domain of a 
particular pTK, ability to stimulate cell growth and/or differentiation of 
cells expressing the pTK, and binding characteristics of the antibody, etc. 
Accordingly, the antibody preferably binds to substantially the same 
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epitope as the anti-HpTK5 monoclonal antibody specifically disclosed 
herein. Most preferably, the antibody will also have substantially the 
same or greater antigen binding affinity of the anti-HpTK5 monoclonal 
antibody disclosed herein. To determine whether a monoclonal antibody has 
5 the same specificity as the anti-HpTK5 antibody specifically disclosed 
(i.e., the antibody having the ATCC deposit No. HB 11,583), one can, for 
example, use a competitive EL ISA binding assay. 

DNA encoding the monoclonal antibodies useful in the method of the 
invention is readily isolated and sequenced using conventional procedures 

10 (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine 
antibodies) . The hybridoma cells of the invention serve as a preferred 
source of such DNA, Once isolated, the DNA may be placed into expression 
vectors, which are then transfected into host cells such as E. coli cells, 

15 simian COS cells, Chinese Hamster Ovary (CHO) cells, or myeloma cells that 
do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. 

The agonist antibodies disclosed herein are useful for in vitro 
diagnostic assays for activating the pTK receptor of interest. This is 

20 useful in order to study the role of the receptor in cell growth and/or 
differentiation . 

The pTK agonist antibodies have a further therapeutic utility in a 
method for enhancing cell growth and/or differentiation comprising 
administering to a human patient in need of such treatment a 

25 physiologically effective amount of an exogenous pTK agonist antibody. 
Agonist antibodies to the SAL-SI pTK may find utility in treating bleeding 
disorders and anemias, since this pTK was found to be expressed in 
megakaryocytic cells. The bpTK agonist antibodies may similarly be used to 
e nh a n ce differentiation and/or proliferation of brain cells in 

30 neurodegenerative diseases (such as Alzheiraers disease) based on the 
expression of these receptors in brain tissue. Finally, HpTK5 agonist 
antibodies may be used to enhance proliferation of primitive hematopoietic 
cells in patients having undergone chemo- or radiation therapy or bone 
marrow transplantation. 

35 An "exogenous" therapeutic compound is defined herein to mean a 

therapeutic compound that is foreign to the mammalian patient, or 
homologous to a compound found in the mammalian patient but produced 
outside the mammalian patient. 
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The antibodies of the present invention are also suitable for 
detecting a pTK by contacting a source suspected to contain the pTK with 
a detectably labeled monoclonal antibody, and determining whether the 
antibody binds to the source. There are many different labels and methods 
5 of labeling known in the art. Suitable labels include, for example, 
enzymes, radioisotopes, fluorescent compounds, chemi- and bioluminescent 
compounds, paramagnetic isotopes. The pTK may be present in biological 
samples, such as biological fluids or tissues. For analytical or 
diagnostic purposes, the antibodies of the present invention are 
10 administered in an amount sufficient to enable the detection of a site on 
a pTK for which the monoclonal antibody is specific. The concentration of 
the detectably labeled monoclonal antibody should be sufficient to give a 
detectable signal above background, when bound to a pTK epitope. 

The pTK agonist antibodies disclosed herein may be administered to 
15 a mammal, preferably a human, in a pharmaceutically acceptable dosage form, 
including those that may be administered to a human intravenously as a 
bolus or by continuous infusion over a period of time, by intramuscular, 
subcutaneous, intra -articular, intrasynovial , intrathecal, oral, topical, 
or inhalation routes. 
20 Such dosage forms encompass pharmaceutically acceptable carriers that 

are inherently nontoxic and nontherapeutic . Examples of such carriers 
include ion exchangers, alumina, aluminum stearate, lecithin, serum 
proteins, such as human serum albumin, buffer substances such as 
phosphates, glycine, sorbic acid, potassium sorbate, partial glyceride 
25 mixtures of saturated vegetable fatty acids, water, salts, or electrolytes 
such as protamine sulfate, disodium hydrogen phosphate, potassium hydrogen 
phosphate, sodium chloride, zinc salts, colloidal silica, magnesium 
trisilicate, polyvinyl pyrrolidone, cellulose -based substances, and* 
polyethylene glycol. Carriers for topical or gel-based forms of antibody 
30 include polysaccharides such as sodium carboxymethylcellulose or 
methylcellulose, polyvinylpyrrolidone , polyacrylates , polyoxyethylene- 
polyoxypropylene-block polymers, polyethylene glycol, and wood wax 
alcohols. For all administrations, conventional depot forms are suitably 
used. Such forms include, for example, microcapsules, nano- capsules, 
35 liposomes, plasters, inhalation forms, nose sprays, and sublingual tablets. 
The antibody will typically be formulated in such vehicles at a 
concentration of about 0.1 mg/ml to 100 mg/ml. 
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Pharmaceutical compositions may be prepared and formulated in dosage 
forms by methods known in the art; for example, see Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pennsylvania, 
15th Edition 1975. 

5 An effective amount of the pTK agonist antibody to be employed 

therapeutically will depend, for example, upon the therapeutic objectives, 
the route of administration, and the condition of the patient. 
Accordingly, it will be necessary for the therapist to titer the dosage and 
modify the route of administration as required to obtain the optimal 

10 therapeutic effect. A typical daily dosage might range from about 1 ftg/kg 
to up to 1000 mg/kg or more, depending on the factors mentioned above. 
Typically, the clinician will administer the molecule until a dosage is 
reached that achieves the desired effect. The progress of this therapy is 
easily monitored by conventional assays. 

15 Depending on the type and severity of the disease, from about 0.001 

mg/kg to about 1000 mg/kg, more preferably about 0.01 mg to 100 mg/kg, more 
preferably about 0.010 to 20 mg/kg of the agonist antibody might be an 
initial candidate dosage for administration to the patient, whether, for 
example, by one or more separate administrations, or by continuous 

20 infusion. For repeated administrations over several days or longer, 
depending on the condition, the treatment is repeated until a desired 
suppression of disease symptoms occurs or the desired improvement in the 
patient's condition is achieved. However, other dosage regimens may also 
be useful. 

25 The present invention will now be illustrated by the following 

Examples, which are not intended to be limiting in any way. The 
disclosures of all literature references cited in the specification are 
expressly incorporated herein by reference. 

EXAMPLE 1 

30 IDENTIFICATION AND ISOLATION OF DTK GENES 

To facilitate the isolation and identification of these novel pTK 
genes, two sets of DMA probes were generally used (see Table 1) . 

The first set consisted of two degenerate oligonucleotide sequences, 
pTK 1 (SEQ ID NO: 1) and pTK 2 (SEQ ID NO: 2) . These sequences were used 
35 as polymerase chain reaction (PCR) primers, using standard PGR techniques, 
to amplify tyrosine kinase DNA segments. 
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The second set consisted of two oligonucleotide sequences, pTK 3 
(SEQ ID NO: 3) and pTKKW (SEQ ID NO: 4) selected from the highly 
conserved regions of the catalytic domains of the c-kit subgroup of 
protein tyrosine kinases. These sequences were also used as polymerase 
5 chain reaction primers in a second round of DNA amplification. Using 
this two-step amplification procedure, DNA fragments which hybridized to 
these pTK primers were identified, isolated and subsequently sequenced 
using known laboratory techniques. 



10 First fipunfl of AmpJ.if^cfttjon 

Probe name Sequence 

pTKl 5 ' -CGGATCCACAGNGACCT- 3 1 

pTK2 5 • - GGAATTCCAAAGGACCAGACGTC - 3 1 



Second Round of Am plification 
15 pTK3 (kit family specific) 5 1 -CGGATCCATCCACAGAGATGT-3 ' 

pTKKW (kit family specific) 5 ' -GGAATTCCTTCAGGAGCCATCCACTT-3 1 



EXAMPLE 2 

ISQMVTIQP AND gfflVRACTgRIgATXPy OF HpTE? 
A. DNA Amplification and Cloning of HpTK5 

20 Light density human bone marrow mononuclear cells, obtained from 

normal volunteers using Deaconess Hospital Institutional Review Board 
approved protocols and with voluntary written informed consent, were 
separated by anti-CD34 antibody (AMAC, Westbrook, ME) and immunomagnetic 
beads (Dynal, Oslo, Norway) . Plow cytometric analysis using FITC- 

25 conjugated anti-CD34 antibody (AMAC) confirmed -95% CD34 positivity of 
isolated cells. The hepatoma cell line, Hep3B, was cultured in alpha 
medium (Gibco, Grand Island, NY) supplemented with penicillin (lOOU/mL) , 
streptomycin (100/ig/mL) and 10% fetal bovine serum (Gibco) at 37°C in a 
5% C0 2 incubator. Total RNA extracted from CD34+ bone marrow mononuclear 

30 or Hep3B cells was reverse transcribed with random primers and the 
Moloney murine leukemia virus reverse transcriptase (RT) following the 
conditions of the manufacturer (Gibco-BRL) in a 20 fA reaction. PCR was 
performed on the RT reaction product in a 100/xl reaction containing 50mM 
KC1, lOmM TrisHCl (pH 8.4), l.5mM MgCl , 20 ng/ml gelatin, 0.2mM dNTPs, 
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2.5 units Taq polymerase (Perkin- Elmer/ Cetus) and 50pmol each of pTK- 
specific degenerate primers 



[pTKl 5 1 TCGGATCCACA/CGNGAC/TC/TTGGC 3' (SEQ ID NO. 35), 
pTKlB 5 1 TCGGATCCAC/TC/AGNGAC/TC/TTNGCNGC 3 f (SEQ ID NO. 36), 
5 pTK2 5 1 CTCGAATTCCA/GA/ TAA/GC/ GT/ ACCAG/CACA/ GTC 3» (SEQ ID NO. 37), 

pTK2B 5 ' CTCGAATTCCA/GA/TAT/CC/GT/ACCAT/AACA/GTC 3 • ( SEQ ID NO . 38)] 
derived from consensus regions among known pTKs as previously reported 
by others (Hanks et al., Science . 211:42-52 [1988]; Wilks, Proc. Nat. 
Acad. Sci.. USA j&£:1603-1607 [1989] ; and Matthews et al . , Cell ££:1143- 
10 1152 [1991]) . The PCR cycle was 1.5min at 95°C, 2rain at 37°C and 3 min 
at 63°C repeated 35 times. The reaction product was electrophoretically 
separated on a 2% low-melting agarose gel, purified on an Elutip-D column 

(Schleicher & Schuell) digested with EcoRl and BamHl, and subcloned into 
pUC19. 

15 Recombinants were sequenced by the Sanger dideoxy method and 

evaluated by the FASTA nucleic acid sequence analysis program. One clone 
termed HpTKS (214 bp) was radiolabeled by random priming and used to 
screen an oligo dT-primed lambda gtlO Hep3B cDNA library. DNA was 
isolated from 17 positive phage plaques and inserts were subcloned into 

20 the EcoRl site of pBluescript (Stratagene La Jolla, CA) . The largest 
insert, a 3969 bp cDNA, was sonicated to an average size of 800-2000 bp 
and cloned into the Smal site of M13. Overlapping clones were sequenced 
using the Taq Dye Primer Cycle Method (CABI) on the Catalyst 800 
Molecular Biology Lab Station (ABI) . Sequencing reactions were then 

25 analyzed on the ABI 373A Automated DNA Sequenator. 

A single full-length 3969 bp cDNA was isolated and sequenced. 
(Figures 8A-8F) . The full length clone, named hepatoma transmembrane 
jcinase (HTK) or HpTK5, included an open reading frame extending from 
nucleotide 90 to 3050 predicted to encode a 987 amino acid protein of 

30 108,270 Dal ton. The putative initiation codon is preceded by an in- frame 
stop codon beginning at base 78 . Preceding the open reading frame is a 
5* untranslated region which is GC-rich as is characteristic for many 
growth factors or growth factor receptors (Kozak, J. Cell Biol. H£:887- 
903 [1991]) . 

35 The predicted protein sequence includes a transmembrane region (aa 

538-563) which divides HpTK5 into extracellular (ECD) and intracellular 
domains (ICD) . The ECD of 538 amino acids includes a signal peptide of 
15 amino acids and a cysteine-rich box containing 20 Cys residues. In 
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addition, there are two fibronectin type III repeats spanning aa 321 to 
425 and 435 to 526. Asn at positions 208, 340 and 431 are possible sites 
for N-glycosylation. 



The putative intracellular domain (ICD) contains a kinase consensus 
5 region from position 613 through 881. This kinase region includes a * 
putative ATP-binding consensus (Gly-X-Gly-X-X-Gly) in subdomain I at 
positions 622-627. A Lys at position 647 (subdomain II) corresponds to 
an invariant Lys among tyrosine kinases thought to be critical for the 
phosphotransfer reaction. Signature regions indicative of substrate 

10 specificity suggest that HpTK5 is a tyrosine rather than a 
serine /threonine kinase. These include the sequence at positions 740-745 
in subdomain VI and the sequence at positions 783-790 in subdomain VIII. 
Tyrosine residues at positions 601, 619 and 741 are possible substrates 
for tyrosine kinase activity. 

15 The predicted amino acid sequence of HpTKS most closely resembles 

that of the subfamily originally defined by EPH. The pattern of 
expression of the EPH subfamily is suggestive of a role in 
differentiation and development. In particular, the emergence of neural 
elements corresponds with the expression of certain EPH- related genes. 

20 The EPH family receptors, Hek2 and Elk, are the most closely related pTKs 
to HpTKS. They share 79.3 and 76.5% identity within the ICD respectively 
and 45 and 42% identity within the ECD respectively. 

B. t&rpmospipe Mapping of HpTKS 

Somatic cell hybrid DNAs from a panel of 25 human-hamster cell 

25 lines (Bios, New Haven, CN) were used for chromosome localization by PCR. 
Two sets of primers from the 3* untranslated region of HpTK5 were chosen. 
PCR was performed with 250 ng DNA and 50 pmol each of the 5* and 3' 
primers, 50 mM KC1, 1.5mM MgCl 2 , 20 j*g/ml gelatin, 0.2 mM dNTPs and 2.5 
units Taq polymerase in a final volume of 100 jtl. Cycles of 94°C for 30 

30 sec, 60°C for 30 sec and 72°C for 30 sec were repeated 30 times. A 
portion of each sample (15 pi) was electrophoresed through a 1.5% agarose 
gel, transferred to a nylon membrane and hybridized to a 3 2 P- labelled 
full length HpTKS cDNA probe prior to 5 hour autoradiography. Positives 
were scored and compared to a matrix summary of human chromosomal 

35 material present in each of the somatic cell hybrid DNAs. 

0 

The 3 * -untranslated region characteristically contains few, if any, 
intervening sequences and has a high degree of diversity among members 
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of gene families making it preferred in this type of analysis. Both sets 
of primers gave results that were consistent with human chromosome 7 
only. Human chromosome 7 also includes the genes for the EGP receptor, 
hepatocyte growth factor (HGF) receptor, HGF, platelet -derived growth 



5 factor (PDGF) and interleukin-6 . Karyotypic abnormalities involving this 
chromosome are common among human leukemias, particularly in aggressive 
myeloid leukemias that occur following radiation, alkylating agent 
chemotherapy or a pre-existing myelodysplastic condition (Baer et al., 
Cttrr, Qpin, Qncp; t 1:24-32 [1992] ) . 

io c. Northern Blotting of HpTKS 

Poly- A selected RNA was elect rophoresed through a 1.2% agarose, 
2.2M formaldehyde gel and transferred to a nylon filter. Prepared or 
commercially obtained filters were hybridized in 50% formamide at 42° C 
to 52 -P labeled HpTKS, glycer aldehyde- 3 -phosphate dehydrogenase (GAPDH) 

15 or actin cDNA inserts and washed under stringent conditions (final wash: 
0.1 x SSC, 0.2% SDS at 65 e C) . SSC is 0.15 M NaCl/ 0.015M Na 3 citrate, pH 
7.6. Northern blots of human fetal or adult tissue RNA were obtained from 
Clontech (Palo Alto, CA) and contained 2 jig/lane of poly A selected SNA. 
Northern blot analysis of human fetal tissues revealed a single 

20 transcript of -4Kb in heart, lung, liver and kidney, with a lesser signal 
detectable in brain. In adult human tissue, no signal was detectable in 
brain, while placenta had a particularly intense signal followed by 
kidney, liver, lung and pancreas. Skeletal muscle and heart were of 
lower signal intensity. 

25 HpTKS expression in human tumor cell lines was also analyzed by 

Northern blot analysis performed as discussed above. Cell lines derived 
from liver, breast (MCF 7) , colon (Colo 205) , lung (NCI 69) , melanocyte 
(HM-1) or cervix (HeLa) had detectable signal of appropriate size. 
Message was present in select cell lines of hematopoietic origin. K562 

30 (a primitive myeloid cell with multipotential) , THP-l (a monocytoid 
cell), U937 (a myelomonocytic cell line), Hep3B (a human hepatocarcinoma 
cell line) , and CMK (of megakaryocytic origin) were all positive for 
HpTKS message, but lymphoid (H9, Jurkat, JH-1, Raji, Ramos) or select 
other myeloid cells (KG-1 or KMT2) had no detectable transcript by 

35 Northern analysis. 

Differential expression of the HpTKS transcript in fetal versus 
adult brain suggests that HpTK5 may share, with other EPH subfamily 
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members, a role in events related to neural development. However, unlike 
some members of the EPH subfamily which are exclusively expressed in 
neurons (Maisonpierre et al., supra), HpTKS is widely expressed in other 
tissues. In particular, HpTKS is expressed in hematopoietic cells 
5 including CD34+ hematopoietic progenitor cells. The presence of the 
HpTKS message in early hematopoietic cells and cell lines of myeloid 
lineage, but not in cell lines derived from lymphoid cells, suggests that 
HpTKS may have lineage restricted expression. 



An HpTKS extracellular domain (ECD) -human IgGx Fc fusion gene was 
constructed and fusion protein produced as previously described (Bennett 
et al., J. Biol. Chem. 266 :23060-23067 T199H ) . Polyclonal antibodies 
were generated in New Zealand White rabbits against the fusion protein; 

15 Afjg in 100/iL PBS was emulsified with lOO^L Freund's adjuvant (complete 
adjuvant for the primary injection and incomplete adjuvant for all 
boosts) . For the primary immunization and the first boost, the protein 
was injected directly into the popliteal lymph nodes (Sigel et al., 
Methods Enzvmol . 22:3-12 [19633 ) . For subsequent boosts, the protein was 

20 injected into subcutaneous and intramuscular sites. 1.3 i*g protein/kg 
body weight was injected every 3 weeks with bleeds taken 1 and 2 weeks 
following each boost. HpTKS specificity of the immunized rabbit serum 
was assessed by flow cytometric analysis of NIH3T3 cells transf ected with 
full length HpTKS or vector alone using a 1:200 dilution of pre-immune 

25 serum or anti-HpTK5-IgG Fc serum. Significant peak shifts were observed 
in several HpTKS expressing clones as compared to either pre-immune serum 
or vector alone transf ectant controls. 



UTILITY AND A GONIST ACTIVITY OF POLYCLONAL ANTIBODIES TO HPTKS 
30 A. FLAG - HpTKS Fusion Construct 

Overlapping oligonucleotides encoding a 12 amino acid peptide 
having the sequence MDYKDDDDKKLAM (SEQ ID NO: 39) which includes the 4 
amino acid antibody recognition site "FLAG" (IBI, New Haven, CT) a 5'- 
EcoRV restriction site and a 3'-NcoI restriction site 
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(5 ' -CCGGATATCATGGACTACAAGGACGACGATGACAAGAAGCTTGCCATGGAGCTC; SEQ ID NO: 
40) , were ligated into the Ncol site (base 88) of HpTK5 in the EcoRV 
digested Bluescript (Stratagene, La Jolla, CA) vector. 



B. ?n vjtm Transgrj.pt j,pn anfl Translation 

5 Transcription was performed on 2 pmol of linearized HpTKS or FLAG- 

HpTK5 containing plasmid at 37°C for 1 h in 50 ill volume containing 10 
mM dithiothreitol, 2.5 ptg bovine serum albumin, 0.25 mM each dNTP, 0.5 
M m7GRNA cap (New England Biolabs, Beverly, MA), 2.5 units RNasin 
(Promega, Madison, WI) , 3 units T3 RNA polymerase (Pharmacia, Piscataway, 

10 NJ) . 1 fig of DNAase (New England Biolabs, Beverly MA) was added for 15 
min at 37°C prior to phenol /chloroform extraction and ethanol 
precipitation. Translation was performed using the Promega rabbit 
reticulocyte lysate kit according to the manufacturer's specifications 
with or without 3S S -methionine (350 fiCi) labeling. Sample buffer 

15 containing SDS and beta-mercaptoethanol (2-ME) was added before boiling 
and 10% SDS -PAGE. 

C. HpTK5 Expression in NIH3T3 Cells 

A 4038 bp Clal - Xbal cDNA fragment containing 32 bp of linker 
sequence, 37 bp of pBluescript (Stratagene La Jolla, CA) polylinker and 

20 the entire 3969 bp HpTKS cDNA was subcloned into the expression vector 
pRIS (Genentech, Inc.) under the control of the Rous sarcoma virus LTR 
promoter. NIH3T3 cells maintained in high glucose Dulbecco's Modified 
Eagle's Medium (DMEM) supplemented with 10% FCS were co-transfected with 
pRIS-HpTK5 and pNeo (an SV40 based vector containing the neomycin 

25 resistance marker) by the calcium phosphate method as described by Gorman 
et al., in DNA Prot. Engineer. Tegh. 2:3-10 [1990] . Neomycin resistant 
colonies were selected 48 hours after transfection with Geneticin 
(Gibco/BRL) at 400 ptg/ml. Fourteen days later individual resistant 
colonies were isolated, expanded and analyzed by flow cytometry for HpTK5 

30 expression using rabbit polyclonal antiserum. 

D. Immunoprecipitation 

Cells (Hep3B, control NIH3T3 or HpTKS transfected NIH3T3) or in 
vitro translated protein (HpTKS or FLAG-HpTK5) were used for 
immunoprecipitation with either serum (pre-immune or anti-HpTK5-IgG Fc) 
35 or monoclonal antibody (FLAG- specific, M2, or isotype control) (IBI, 
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Rochester, NY) . Subconf luent cells were labeled with 200/iCi/ml 35 S- 
methionine for 18 hours and lysed in lysis buffer (150 mM NaCl, 50 mM 
Tris-HCl pH8.0, 1 mM EDTA, 0.025 Na azide, 1% NP-40, 0.1% SDS, 10% 
Glycerol, 0.5% Na deoxycholate, 1 mM phenylmethylsulfonyl flouride 



5 (PMSF) , 10 fig/ml aprotinin, 10 jig/ml leupeptin and 50 /xM Na vanadate) for 
30 min on ice. The cell lysate was centrifuged (12,000 X g) for 10 min 
at 4°C. Cell lysate supernatant or in vitro translation mixture was 
precleared with 0.05 volume of normal rabbit serum and adsorbed with 0.05 
volume of Staphylococcus aureus protein-A Sepharose CL4B. After 

10 centrifugation, preimmune or immune serum (1:100 dilution), or monoclonal 
antibody, was added and rocked overnight at 4°C before 100 fil of protein- 
A Sepharose CL4B was added and the solution rocked 4°C for additional 2 
h. Immunoprecipitates were washed, suspended in SDS/PAGE loading buffer 
(10% glycerol, 5% 2-MB, 2.3% SDS and 62.5mM Tris-HCl pH 6.8), heated to 

15 95°C for 5 min and analyzed by 7.5% SDS-PAGE. 



H. Cell Fractionation 

Cell fractionation of Hep3B cells was performed to confirm the 
membrane localization of HpTKS predicted by its amino acid sequence . Hep- 
3B cells (lxlO 7 ) were labeled with 200/iCi/ml 35 S -methionine in alpha MEM 

20 medium containing 10% dialyzed FCS overnight. The cells were washed twice 
with cold PBS, scraped into 1ml of cold buffer (20mM Tris-HCl pH 7.5, 2mM 
EDTA, 5mM EGTA, 0.25M sucrose, 0.01% leupeptin, 4mM PMSF, lOmM 2-ME) and 
disrupted by sonication for 40 seconds. Whole homogenates were 
centrifuged at 12,000 X g for 15 min, the nuclear pellets isolated and 

25 the decanted supernatant centrifuged at 140,000 X g for 40 min at 4°C to 
pellet membranes. The resultant supernatant served as the cytosolic (c) 
fraction. Nuclear (N) and membrane (M) fractions were washed and 
dissolved in buffer containing 0.5% NP-40 prior to immunoprecipitation. 
The C, N or M fractions were immunoprecipitated with an anti -HpTKS or 

30 pre-immune (control) serum, subjected to 12% SDS-PAGE and 
autoradiographed. HpTKS segregated predominantly with the membrane 
fraction, though immunoprecipitated material was evident to a lesser 
extent in cytosol. 



F. Protein Kinase Assay 

Immunoprecipitates were washed once with kinase buffer (25mM Hepes 
pH7.4, lmM DTT, lOmM MgCl, lOmM MnCl) , and resuspended in 40/zl of kinase 
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buffer containing either unlabeled ATP or 10/iCi of "P-ATP (3000Ci/mM) . 
After a lOmin incubation at 30°C, the reaction was stopped by adding 40^1 
of 2 X sample buffer and boiling the samples for 3min prior to 
electrophoresis on 8.0% SDS-PAGE gel. The dried gel was covered with 4 



5 sheets of aluminum foil to block 35 S- labelled protein autoradiography and 
the gel was placed under film for 5 hours to overnight. 

6. Western Blotting and Phosphotvrosine Assay 

Proteins were electrophoretically transferred to a 0.2 fim 
nitrocellulose (Bio-Rad) or a 0.45/zm polyvinyl idene diflouride 

10 (Millipore) membrane in a buffer containing 25 mM Tris-HCl (pH 7.5) , 192 
mM glycine and 20% methanol at 100 mA for 2 h. Filters were washed in 
TBS (10 mM Tris-HCl pH 8.0, 150 mM NaCl) blocked by incubating in TBST 
(TBS with 0.05% Tween-20) plus 5% BSA overnight. Filters were washed 
four times for 5 min each in TBST and incubated for 2 h with 4G10 anti- 

15 phosphotvrosine antibody from UBI (1:1000 dilution in TBST). Filters 
were washed four times for 5 min each in TBST and incubated for l h with 
the alkaline phosphatase labelled anti-mouse secondary antibody (Promega) 
at a 1:7500 dilution in TBST. After washing four times, the blot was 
developed for 30-60 min in AP buffer (lOOmM Tris-HCl, 100 mM NaCl, 5 mM 

20 MgCl 2 ) plus BCIP, NBT substrates. 

H. Antibody Induced Phosphorylation Assay 

Rabbit antisera to HpTKS - IgG Fc were tested for their ability to 
induce HpTKS phosphorylation in HpTK5 transfected NIH3T3 cells. Cells 
were plated at a density of 5 x 10 s cells/well in a 6-well plate and, 
after 24 hours, were serum starved for 1 hour prior to adding pre-immune 
or immune serum at a 1:50 dilution for 30 minutes. Cells were then 
washed in PBS and lysed in either 2X sample buffer or NP-40 lysis buffer 
as described above. Either crude lysates or immunoprecipitated cell 
lysates were then separated via 4-12% gradient SDS-PAGE and analyzed by 
ant i -phosphotvrosine immunoblot as described above. HpTKS expressing 
cells were exposed to antisera and separated by SDS-PAGE either with or 
without immunoprecipitation. The electrotransf erred gel was immunoblotted 
with anti-phosphotyrosine antibody. Enhanced tyrosine phosphorylation of 
HpTKS was observed following exposure to polyclonal antiserum showing an 
agonist-like effect of antibody binding. Interaction of HpTK5 with an 
antibody directed against its ECD induces phosphorylation. This provides 
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further support that HpTK5 may serve as a receptor for a ligand that 
triggers kinase activation. Details of the signaling pathway of HpTK5 may 
be further explored using antisera as a surrogate ligand. 



I. Conclusions 

5 An HpTK5 ECD-IgG Fc fusion protein was expressed, purified and used 

to generate rabbit anti-serum which immunoprecipitated a l20kD protein 
from Hep3B cells. The specificity of the antiserum was confirmed by 
immunoprecipitation of in vitro translated HpTK5 RNA and HpTK5 
transfected NIH3T3 cells. To determine the functional capacity of HpTKS, 

10 in vitro translated HpTK5 was immunoprecipitated, exposed to kinase 
conditions and immunoblotted using a phospho tyrosine specific monoclonal 
antibody. The data obtained indicated that HpTK5 is phosphorylated on 
tyrosine. However, the presence of other bands consistently appearing in 
the "P-labelled immunoprecipitation suggested that HpTK5 protein was 

15 only partially purified and therefore, it could not be concluded that 
HpTKS was enzymatically active. To overcome this problem, a fusion 
construct was generated in which an 8 amino acid epitope (FLAG) was added 
to the N- terminus of HpTKS. The FLAG-HpTK5 fusion was in vitro 
translated and immunoprecipitated with a FLAG- specific monoclonal 

20 antibody resulting in a single protein of appropriate size (~120kD) . 
When subjected to kinase conditions in the presence of "P-ATP, the 
HpTK5 - FLAG fusion protein was labelled on tyrosine confirming tyrosine 
aut ©phosphorylation and thereby, the kinase function of HpTKS. 



EXAMPLE 5 

25 PRODUCTION OF MONOCLONAL ANTIBODIES TO HPTKS 

Ant i- HpTKS monoclonal antibodies were produced by hyper immunizing 
BALB/c mice intraperitoneal ly with the HpTKS extracellular domain (ECD) - 
human IgG x Fc fusion protein (produced using the techniques disclosed 
above) in RIBI adjuvant (RIBI ImmunoChem Research, Hamilton, MT) and 

30 fusing splenocytes with the mouse myeloma cell line X63-Ag8.653 (Kearney 
et al., J. Immunol. 123 :1548-1550 [1979]). The antibodies were purified 
from ascites fluid using protein A-Sepharose (Repligen Corp., Cambridge, 
MA) and established affinity chromatography methods (Goding, J.W., ^ 
Immunol. Methods 20:241-253 [1978]). 

35 Monoclonal antibodies were screened for their ability to bind the 

HpTK5 antigen. Starting on day 15 post fusion, culture supernatant s were 
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harvested from the fusion plates and assayed for their ability to 
specifically "capture" HpTKS- IgG. In this ELISA assay, goat anti-mouse 
IgG was coated onto 96 well microtiter plates. The culture supernatants 
(100^1) were added to the wells and the mouse IgG present was bound by 
the goat anti -mouse IgG antibodies. The plates were washed and either 
HpTKS - IgG or CD4-IgG (100/il at 6nM) was added. The "captured" 
immunoadhesin was detected using a goat anti-hu (Pc specific) horseradish 
peroxidase conjugate and orthophenylene diamine substrate. Quantitation 
of substrate catalysis was determined by optical density at 490nm. 

Agonist antibodies were then screened for using the techniques 
disclosed in Example 6 below. Two agonist monoclonal antibodies were 
identified, one of which has been deposited with the ATCC. 

AGONIST ACTIVITY OF MONOCLONE, ANTIBODIES TO HPT** 

The monoclonal antibodies produced using the techniques disclosed 
in Example 5 were tested for their ability to induce HpTK5 
phosphorylation in HpTKS transfected NIH3T3 cells. Cells were plated at 
a density of 5 x 10 s cells/well in a 6-well plate and, after 24 hours, 
were serum starved for 1 hour prior to adding pre- immune serum or anti- 
HpTKS monoclonal antibody (undiluted conditioned hybridoma media was 
used) for 30 minutes. Cells were then washed in PBS and lysed in either 
2X sample buffer or NP-40 lysis buffer as described above. Either crude 
lysates or immunoprecipitated cell lysates were then separated via 4-12% 
gradient SDS-PAGE and analyzed by anti-phosphotyrosine immunoblot as 
described above. HpTKS expressing cells were exposed to the monoclonal 
antibody and separated by SDS-PAGE either with or without 
immunoprecipitation. The electrotransf erred gel was immunoblotted with 
anti-phosphotyrosine antibody. En ha n ced tyrosine phosphorylation of HpTKS 
was observed following exposure to monoclonal antibodies showing an 
agonist-like effect of antibody binding. Accordingly, interaction of 
HpTK5 with a monoclonal antibody directed against its ECD is able to 
induce phosphorylation of the kinase domain thereof. 

EXAMPLE? 7 

PEQPPCnON OF POLYCLONAL ANTIBODIES TO fl&T-cp 
A SAL-SI extracellular domain (ECD) -human IgG x Fc fusion gene was 
constructed and fusion protein produced as previously described in 
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Bennett et al. t J. Biol. Chem. 2££:23060-23067 [1991]. Briefly, PCR 
primers otk 1.41.1 (SEQ ID NO: 43) and otk 1.41.2 (SEQ ID NO: 44) were 
employed in the PGR technique using plasmid pRK5.tkl-l.l (SEQ ID NO: 45) 
containing SAL-SI nucleic acid as a template to create a DNA fragment 



5 which, when digested with Sall/BstEII, generated an 155bp Sall/BstEII 
fragment. This 155bp fragment was combined with a 6839bp Sall/Hindlll 
fragment isolated from pRK5.tkl-l.l and a 719 bp BstEII/Hindlll fragment 
isolated from pBSSK-CH2-CH3 (Bennett et ai., supra). These fragments 
were ligated together to create a plasmid pRK5.tkl.igl.! (7713bp in size) 

10 which, when transfected into 293 cells, was used to produce a SAL-SI 
extracellular domain (ECD) -human IgG Fc fusion protein. Fusion protein 
was prepared and purified as described in Bennett et al., supra. 
Polyclonal antibodies were generated in female New Zealand White rabbits 
against the fusion protein. Briefly, 12.5A«g of fusion protein in 0.625ml 

15 PBS was emulsified with 0.625ml Freund's adjuvant (complete adjuvant for 
the primary injection and incomplete adjuvant for all boosts) . The 
primary injection and all boosts were intramuscular at two sites and 
subcutaneous at multiple sites. Boosts were carried out at 3 week 
intervals with bleeds taken 1 and 2 weeks following each boost. SAL-SI 

20 specificity of the immunized rabbit serum was assessed by flow cytometric 
analysis of 293 (ATCC CRL 1593) and COS7 (ATCC CRL 1651) cells 
transfected with full length SAL-SI or vector alone (see below) using a 
1:200 dilution of pre -immune serum or anti-SAL-Sl-IgG Fc serum. 
Significant peak shifts were observed in several SAL-SI expressing clones 

25 as compared to either pre -immune serum or vector alone trans fee tant 
controls . 



EXAMPLE 8 

UTILITY AND AGONIST ACTIVITY OF SAL-SI POL YCLONAL ANTIBODIES 
A. ynrmunoprecj.p^tatioq 

30 Control 293 and COS7 cells as well as SAL-SI transfected 293 and 

C0S7 cells were used for immunoprecipitation with either pre-immune serum 
or anti-SAL-Sl-IgG Fc polyclonal antibody. COS7 and 293 cells were 
transfected using a CaP0 4 procedure as described by Gorman, C. DNA 
Cloning, Glover D. Ed., IRL Press, Oxford, vol2: 143-190 (1985). For 

35 transient expression, 293 cells were transfected as described by Gearing 
et al. EMBO 8: 3667-3676 (1989). Subconfluent cells were labeled with 
200/xCi/ml 3S S- methionine for 18 hours and lysed in lysis buffer (150 mM 
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NaCl, 50mM HEPES, pH 7.5, 1 mM EGTA, 0.025 Na azide, 1% Triton-X 100, 
1.5mM MgCl 2 , 10% Glycerol, 1 mM phenylmethylsulfonyl flouride [PMSF] , 10 
/zg/ml aprotinin, 10 jig/ml leupeptin and 50 /iM Na vanadate) for 10 min on 
ice. The cell lysate was centrifuged (12,000 X g) for 10 min at 4°C. 



5 After centrifugation, preimmune or polyclonal antibody was added to the 
supernatant and rocked for 4 hrs at 4°C before 100 /il of protein-A 
Sepharose CL4B was added and the solution rocked 4°C for additional 2 h. 
Immunoprecipitates were washed, suspended in SDS/PAGE loading buffer (10% 
glycerol, 5% 2-ME, 2.3% SDS and 62.5mM Tris-HCl pH 6.8), heated to 95°C 
10 for 5 min and analyzed by 7.5% SDS-PAGE. 

B. Western Blotting and Phospho tyrosine Assay 

Proteins were electrophoretically transferred to a 0.2 /zm 
nitrocellulose (Bio-Rad) or a 0.45jim polyvinyl idene diflouride 
(Millipore) membrane in a buffer containing 25 mM Tris-HCl (pH 7.5), 192 

15 mM glycine and 20% methanol at 100 mA for 2 h. Filters were washed in 
TBS (10 mM Tris-HCl pH 8.0, 150 mM NaCl) blocked by incubating in TBST 
(TBS with 0.05% Tween-20) plus 5% BSA overnight. Filters were washed 
four times for 5 min each in TBST and incubated for 2 h with 4G10 anti- 
phosphotyrosine antibody from UBI (1:1000 dilution in TBST) . Filters 

20 were washed four times for 5 min each in TBST and incubated for 1 h with 
the alkaline phosphatase labelled anti-mouse secondary antibody (Promega) 
at a 1:5000 dilution in TBST. After washing four times, the blot was 
developed for 30-60 min in AP buffer (lOOmM Tris-HCl, 100 mM NaCl, 5 mM 
MgCl 2 ) plus BCIP, NBT substrates. 

25 C. Antibody Induced Phosphoryla tion Assay 

Rabbit antisera to SAL-SI- IgG Fc were tested for their ability to 
induce SAL-Sl phosphorylation in SAL-SI transfected 293 cells. Cells were 
plated at a density of 5 x 10 s cells/well in a 6-well plate and, after 
24 hours, were serum starved for 12 hours prior to adding pre- immune or 

30 immune serum at a 1:5 dilution for 30 minutes. Cells were then washed 
in PBS and lysed in either sample buffer or Triton-X lysis buffer as 
described above. Either crude lysates or immunoprecipitated cell lysates 
were then separated via 8% or 4-12% gradient SDS -PAGE and analyzed by 
anti-phosphotyrosine immunoblot as described above. SAL-SI expressing 

35 cells were exposed to antisera and separated by SDS-PAGE either with or 
without immunoprecipitation. The electrotransf erred gel was immunoblotted 
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with ant i-phospho tyrosine antibody. Enhanced tyrosine phosphorylation of 
SAL- SI was observed following exposure to polyclonal antiserum showing 
an agonist-like effect of antibody binding. Interaction of SAL-SI with 
an antibody directed against its ECD induces phosphorylation. 



Ant i- SAL- SI monoclonal antibodies were produced by hyper immunizing 
BALB/c mice in the foot pad with the SAL- SI extracellular domain-human 
IgGi Fc fusion protein in RIBI adjuvant (RIB I Immunochem Research, 

10 Hamilton, MT) and fusing lymphocyte from lymph nodes with the mouse 
myeloma cell line X63-Ag8Ul. 

Starting on day 10 post fusion, cultured supernatant s were harvest 
from the fusion plates and assayed for their ability to bind to SAL-SI. 
In this ELISA assay, SAL-SI XgG x was coated onto 96 microtiter plates. 

15 The cultured supernatants (100/il) were added to the wells and the mouse 
antibodies present were bound to Sal-Si IgGj. The plates were washed and 
mouse IgG was detected using a goat ant i -mouse IgG (Fc specific with no 
cross reactivity against human IgG Fc) horseradish peroxidase conjugate 
and orthophenylene diamine substrate. Quantitation of substrate catalysis 

20 was determined by optical density at 490 mtw 

Cultured supernatants which were positive from ELISA were then 
tested for their ability to specifically bind to 293 transfected with 
SAL-Sl receptor and analyzed by flow cytometry. Agonist antibodies were 
then screened for using the techniques disclosed in Example 10 below. Six 

25 agonist monoclonal antibodies were identified. 

BXaMELfi 10 

AGONIST ACTIVITY OF MONOCLONAL ANTIBODI ES TO SAL-SI 

The monoclonal antibodies were tested for their ability to induce 
SAL-Sl phosphorylation in SAL-SI transfected 293 cells. Cells were 
30 harvested from tissue culture dish by assay buffer and washed 2x with the 
same buffer. 1x10 s cells were added to a 96 U-bottom plate which was 
centrifuged and assay buffer was removed. 150 (il of cultured supernatants 
was added to each well followed by incubation at 37°C for 30 minutes, the 
plate was centrifuged and cultured supernatants were removed. 100 jil of 



35 Fixing solution was added, the cells were fixed for 30 minutes at -20 °C, 
cells were washed with buffer 2x and stained with anti-phosphotyrosine 



5 
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PRODUCTION OF MONOCLONAL ANTIBODIES TO SAL-Sl 
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conjugate with FITC for 60 minutes at 4°C. Cells were analyzed by flow 
cytometry (FACScan Becton Dickinson, milplitas, CA) . The six anti-SAL-Sl 
monoclonal antibodies were able to induce SAL-SI phosphorylation in SAL- 
SI transfected 293 cells. 

Deposit of Materials 
The following culture has been deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, MD, USA (ATCC) : 



This deposit was made under the provisions of the Budapest Treaty 
on the International Recognition of the Deposit of Microorganisms for the 
Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty) . This assures maintenance of a viable culture for 30 years from 
the date of deposit. The organism will be made available by ATCC under 
the terms of the Budapest Treaty, and subject to an agreement between 
Genentech, Inc. and ATCC, which assures permanent and unrestricted 
availability of the progeny of the culture to the public upon issuance 
of the pertinent U.S. patent or upon laying open to the public of any 
U.S. or foreign patent application, whichever comes first, and assures 
availability of the progeny to one determined by the U.S. Commissioner 
of Patents and Trademarks to be entitled thereto according to 35 USC §122 
and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 
with particular reference to 886 OG 638) . 

The assignee of the present application has agreed that if the 
culture on deposit should die or be lost or destroyed when cultivated 
under suitable conditions, it will be promptly replaced on notification 
with a viable specimen of the same culture. Availability of the 
deposited strain is not to be construed as a license to practice the 
invention in contravention of the rights granted under the authority of 
any government in accordance with its patent laws. 

The foregoing written specification is considered to be sufficient 
to enable one skilled in the art to practice the invention. The present 
invention is not to be limited in scope by the culture deposited, since 
the deposited embodiment is intended as a single illustration of one 
aspect of the invention and any culture that are functionally equivalent 
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are within the scope of this invention. The deposit of material herein 
does not constitute an admission that the written description herein 
contained is inadequate to enable the practice of any aspect of the 
invention, including the best mode thereof, nor is it to be construed as 
5 limiting the scope of the claims to the specific illustration that it 
represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled 
in the art from the foregoing description and fall within the scope of 
the appended claims. 
10 Equivalents 

Those skilled in the art will recognize, or be able to ascertain 
using no more than routine experimentation, many equivalents to the 
specific embodiments of the invention described herein. Such equivalents 
are intended to be encompassed by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Genentech, Inc. 



10 



15 



20 



Bennett, Brian D. 
Goeddel, David 
Lee, James M. 
Matthews, William 
Tsai, Siao Ping 
Wood, William I. 



(ii) TITLE OP INVENTION: PROTEIN TYROSINE KINASE AGONIST ANTIBODIES 
(iii) NUMBER OF SEQUENCES: 45 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: patin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/222616 

(B) FILING DATE: 04 -APR- 1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Wendy M. Lee 

(B) REGISTRATION NUMBER: 00,000 

(C) REFERENCE/DOCKET NUMBER: 821P3PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/225-1994 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : ' 

(A) LENGTH: 17 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 



CGGATCCACA GNGACCT 17 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 23 bases 

(B-) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



10 GGAATTCCAA AGGACCAGAC GTC 23 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CGGATCCATC CACAGAGATG T 21 



(2) INFORMATION FOR SEQ ID NO: 4: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GGAATTCCTT CAGGAGCCAT CCACTT 26 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 160 bases 
30 (B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGATCCTGTG CATCAGTGAC TTAGGGCTAG GAACATTCTG CTGTCGGAAA 50 
5 GCGACGTGGT GAAGATCTGT GACTTTGGCC TTGCCCGGGA CATCTACAAA 100 
GACCCCAGCT ACGTCCGCAA GCATGCCCGG CTGCCCCTGA AGTGGATGGC 150 
GCCAGAATTC 160 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Pro Val His Gin Xaa Leu Arg Ala Arg Asn lie Leu Leu Ser 
15 1 5 10 is 

Glu Ser Asp Val Val Lys lie Cys Asp Phe Gly Leu Ala Arg Asp 
20 25 30 

He Tyr Lys Asp Pro Ser Tyr Val Arg Lys His Ala Arg Leu Pro 
35 40 45 

20 Leu Lys Trp Met Ala Pro Glu Phe 

50 53 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 147 bases 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGATCCATTC ACAGAGACCT AGCAGCACGC AACATCCTGG TCTCAGAGGA 50 
30 CCTGGTAACC AAGGTCAGCG ACTTTGGCCT GGCCAAAGCC GAGCGGAAGG 100 
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GGCTAGACTC AAGCCGGCTG CCCGTCAAAT GGATGGCTCC CGAATTC 147 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 49 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Ser lie His Arg Asp Leu Ala Ala Arg Asn lie Leu Val Ser 
15 10 15 

10 Glu Asp Leu Val Thr Lys Val Ser Asp Phe Gly Leu Ala Lys Ala 

20 25 30 

Glu Arg Lys Gly Leu Asp Ser Ser Arg Leu Pro Val Lys Trp Met 
35 40 45 

Ala Pro Glu Phe 
15 49 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 bases 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GTTGGAATTC CTTCCGGCGC CATCCATTTC ACCGGCAGCT TTATTTCGTG 50 



TCTAGATTCA TAGATGTCTT CATTATCTAC CTTAAAAACT CTGGCAAGTC 100 



25 CAAAATCTGC TACTTTGTAG ATATTATGTT CACCAACGAG GACATTCCT 149 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Gly lie Pro Ser Gly Ala He His Phe Thr Gly Ser Phe He 
1 5 10 15 
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Ser Cys Leu Asp Ser Met Ser Ser Leu Ser Thr Leu Lys Thr Leu 
20 25 30 

Ala Ser Pro Lys Ser Ala Thr Leu lie Leu Cys Ser Pro Thr Arg 
35 40 45 



5 Thr Phe 
47 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 151 bases 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GTGCACAGGG ATCTCGCGGC TCGGAACATC CTCGTCGGGG AAAACACCCT 50 



15 CTCGAAAGTT GGGGACTTCG GGTTAGCCAG GCTTATCAAG GAGGACGTCT 100 



ACCTCTCCCA TGACCACAAT ATCCCCTACA AATGGATGGC CCCTGAGGGA 150 
A 151 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Val His Arg Asp Leu Ala Ala Arg Asn lie Leu Val Gly Glu Asn 
25 1 5 10 * 15 

Thr Leu Ser Lys Val Gly Asp Phe Gly Leu Ala Arg Leu lie Lys 
20 25 30 

Glu Asp Val Tyr Leu Ser His Asp His Asn He Pro Tyr Lys Trp 
35 40 * * 45 

30 Met Ala Pro Glu Gly 

50 

(2) INFORMATION FOR SEQ ID NO:l3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 137 bases 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTTCACCGAG ATCTCAAGTC CAACAACATT TTGCTGCTGC AGCCCATTGA 50 
GAGTGACGAC ATGGAGCACA AGACCCTGAA GATCACCGAC TTTGGCCTGG 100 
CCCGAGAGTG GCACAAAACC ACACAAATGA GTGCCGC 137 

(2) INFORMATION FOR SEQ ID NO: 14: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

15 Val His Arg Asp Leu Lys Ser Asn Asn He Leu Leu Leu Gin Pro 
15 10 15 

He Glu Ser Asp Asp Met Glu His Lys Thr Leu Lys He Thr Asp 
20 25 30 

Phe Gly Leu Ala Arg Glu Trp His Lys Thr Thr Gin Met Ser Ala 
20 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 bases 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTCAATCGTG ACCTCGCCGC CCGAAATGTG TTGCTAGTTA CCCAACATTA 50 
CGCCAAGATC AGTGATTTCG GACTTTCCAA AGCACTGCGT GCTGATGAAA 100 
30 ACTACTACAA GGCCCAGACC CATGGAAAGT GGCCTGTCAA GTGGTACGCT 150 
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CCGGAATGCA TCAACTACTA CAAGTTCTCC AGCAAAAGCG ATGTCTGGTC 200 

CTTTGGAATT C 211 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Val Asn Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Val Thr Gin 
10 1 5 10 15 

His Tyr Ala Lys lie Ser Asp Phe Gly Leu Ser Lys Ala Leu Arg 
20 25 30 

Ala Asp Glu Asn Tyr Tyr Lys Ala Gin Thr His Gly Lys Trp Pro 
35 40 45 

15 Val Lys Trp Tyr Ala Pro Glu Cys lie Asn Tyr Tyr Lys Phe Ser 

50 55 60 

Ser Lys Ser Asp Val Trp Ser Phe Gly lie 
65 70 

(2) INFORMATION FOR SEQ ID NO: 17: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT 50 



TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 100 



TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 150 



ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA 200 



30 TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 250 
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ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 300 
AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC 350 
TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 400 
GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 450 
5 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA 500 
AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC 550 
AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 600 
TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT 650 
CCATAGAAGA CACCGGGACC GATCCAGCCT CCGCGGCCGG GAACGGTGCA 700 
10 TTGGAACGCG GATTCCCCGT GCCAAGAGTG ACGTAAGTAC CGCCTATAGA 750 
GTCTATAGGC CCACTTGGCT TCGTTAGAAC GCGGCTACAA TTAATACATA 800 
ACCTTATGTA TCATACACAT ACGATTTAGG TGACACTATA GAATAACATC 850 
CACTTTGCCT TTCTCTCCAC AGGTGTCCAC TCCCAGGTCC AACTGCACCT 900 
CGGTTCTATC GATTGAATTC CCCGGGGATC CTCTAGAGAT CCCTCGACCT 950 
15 CGAGATCCAT TGTGCTGGCG CGGATTCTTT ATCACTGATA AGTTGGTGGA 1000 
CATATTATGT TTATCAGTGA TAAAGTGTCA AGCATGACAA AGTTGCAGCC 1050 
GAATACAGTG ATCCGTGCCG CCCTAGACCT GTTGAACGAG GTCGGCGTAG 1100 
ACGGTCTGAC GACACGCAAA CTGGCGGAAC GGTTGGGGGT TCAGCAGCCG 1150 
GCGCTTTACT GGCACTTCAG GAACAAGCGG GCGCTGCTCG ACGCACTGGC 1200 
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CGAAGCCATG CTGGCGGAGA ATCATAGCAC TTCGGTGCCG AGAGCCGACG 1250 
ACGACTGGCG CTCATTTCTG ACTGGGAATG CCCGCAGCTT CAGGCAGGCG 1300 
CTGCTCGCCT ACCGCCAGCA CAATGGATCT CGAGGGATCT TCCATACCTA 1350 
CCAGTTCTGC GCCTGCAGGT CGCGGCCGCA CTACTCTTTG ATGTATTACT 1400 
5 CATATTACCA AGGAATAACT GGCGGGCACA GGGTCAGGTG CTGAAGGGAC 1450 
ATTGTGAGAA GTGACCTAGA AGGCAAGAGG TGAGCCCTCT GTCACGCTGG 1500 
CATAAGGGCC GCTTGAGGGC TCTTTGGTCA AGCAGTAACG CCAGTGTCTG 1550 
GGAAGGCACC TGTTACTCAG CAGACCATGA AAGGGCGTCT CCCTTTCCTT 1600 
GGAGCAGTCA GGGAACACTC TGCTCCACCA GCTTCTTGTG GGAGCCTGGA 1650 

10 TATTATCCAG GCCTGCCCGC AGTCATCCGG AGGCCTAACC CCTCCCTGTG 1700 
GTGCTTCAGT GGTCACACTC CTTGTCCACT TTCATGCTCC TCTTGGCCTC 1750 
CTGGTTCCTC TTGGAAGTTT GTAGTAGATA GCAGAAGAAA TAGCGAAAGT 1800 
CTTAAAGTCT TTGATCTTTC TTATAAGTGC AGAGAAGAAA TGCTGACGTA 1850 
TGCTGCCTTC TCTCTCTCTG CTTCAGCTAC CTGAAGCCGC TTTCTTGTCT 1900 

15 ATACCTGCTC TCTATCTGCT CACACTCCTC CGAGGCCAGC ACCATCCCAC 1950 
TGTCTGTCTG GTTGTCCACA GAGCCTTTGT AGGTCGTTGG GGTCATGGGG 2000 
AATTCCTCAA ATGTCTTCAT CCTGGAGGAA CCACGGGTCT CAGCCCCTCT 2050 
GGCCAGGCAC CCGGGAAAGG ACACCCAGTT GTAATACCTG GCGGCCAGGC 2100 
TGTGGCGCTG CAGGCTTGGC GGGCTGTCCT CAGCGTCAGC CTGGGCGATG 2150 
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TGTAGGGCCA TGGTGGACAC CTGCGAGAAG CTGCCCTCTT CTGAGCTCTG 2200 

AGAGCTGCGC GGGGCCATGC AGACCTCCTC TTCCTCTTGC AGGCCCCTGC 2250 

CCTGGAGCAG GTCCCCCAGG ATCTCCACCA GCTCCGAGAA TGCAGGTCTC 2300 

GCCTTGGGGT CTCCGGACCA GCAGTTCAGC ATGATGCGGC GTATGGCGGG 2350 

5 AGTGGCCAGC TCCGGGGCCC TCATCCTTGT GCCGTCTCTC AGCCGCTGGC 2400 

AGAACTCCTC ATTGATCTGC ACCCCAGGGT ACGGGGAGGC CCCCAGAGAG 2450 

AAGATCTCCC AGAGAAGCAC CCCAAAGGAC CACACGTCAC TCTGCGTGGT 2500 

GTACACCTTG TCGAAGATGC TTTCAGGGGC CATCCACTTC AGGGGCAGCC 2550 

GGGCACTGCC CTTGCGGACG TAGTCGGGGT CTTTGTAGAT GTCCCGGGCA 2600 

10 AGGCCAAAGT CACAGATCTT CACCACGTCG CTTTCCGACA GCAGAATGTT 2650 
CCGAGCAGCC AGGTCTCTGT GGATGCACTT TCGGGAAGCC AGGAACTCCA 2700 
TCCCTCTGGC CACCTGGAAG CTGTAGCAGA CAAGATCTTC CATGGTCAGC 2750 
GGGCTCAGCC ACAGGTCCTC AGCTTCTTGG TCTGGAGAAG CCCGCCTCGC 2800 
TCCGCCCTCG GTCTTCGAGA ACCGCGCGAA GAGGACCCTG TCGCTGCTCC 2650 

15 CCGGCCGCCT CCGATCCAGC CTGGCGAGCT CCACCATGGC GCGGAAGCGT 2900 
CCGCGCTGCT CGGGAGACTT CTCCTGCGGA TGCACGAAGC TGGCTCGAGG 2950 
GCGCCCAGTC GTCCGCCGCA GAGGCGCCTC CATTCCCCCG CCGCCCGCGG 3000 
CGCCCCGCAG GCCGCCCGCT CACCGNGCAG GGGCTGCGGC CGCGACTCTA 3050 
GAGTCGACCT GCAGAAGCTT GGCCGCCATG GCCCAACTTG TTTATTGCAG 3100 

-42- 




WO 95/27061 PCTVUS95/04228 



CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA 3150 
GCATTTTTTT CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT 3200 
ATCTTATCAT GTCTGGATCG ATCGGGAATT AATTCGGCGC AGCACCATGG 3250 
CCTGAAATAA CCTCTGAAAG AGGAACTTGG TTAGGTACCT TCTGAGGCGG 3300 
5 AAAGAACCAG CTGTGGAATG TGTGTCAGTT AGGGTGTGGA AAGTCCCCAG 3350 
GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 3400 
ACCAGGTGTG GAAAGTCCCC AGGCTCCCCA GCAGGCAGAA GTATGCAAAG 3450 
CATGCATCTC AATTAGTCAG CAACCATAGT CCCGCCCCTA ACTCCGCCCA 3500 
TCCCGCCCCT AACTCCGCCC AGTTCCGCCC ATTCTCCGCC CCATGGCTGA 3550 
10 CTAATTTTTT TTATTTATGC AGAGGCCGAG GCCGCCTCGG CCTCTGAGCT 3600 
ATTCCAGAAG TAGTGAGGAG GCTTTTTTGG AGGCCTAGGC TTTTGCAAAA 3650 
AGCTGTTAAC AGCTTGGCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG 3700 
AAAACCCTGG CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCCTTC 3750 
GCCAGCTGGC GTAATAGCGA AGAGGCCCGC ACCGATCGCC CTTCCCAACA 3800 
15 GTTGCGTAGC CTGAATGGCG AATGGCGCCT GATGCGGTAT TTTCTCCTTA 3850 
CGCATCTGTG CGGTATTTCA CACCGCATAC GTCAAAGCAA CCATAGTACG 3900 
CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 3950 
GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 4000 
CCCTTCCTTT CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC 4050 
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GGGGGCTCCC TTTAGGGTTC CGATTTAGTG CTTTACGGCA CCTCGACCCC 4100 
AAAAAACTTG ATTTGGGTGA TGGTTCACGT AGTGGGCCAT CGCCCTGATA 4150 
GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC 4200 
TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGG CTATTCTTTT 4250 
5 GATTTATAAG GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT 4300 
GATTTAACAA AAATTTAACG CGAATTTTAA CAAAATATTA ACGTTTACAA 4350 
TTTTATGGTG CACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC 4400 
CAACTCCGCT ATCGCTACGT GACTGGGTCA TGGCTGCGCC CCGACACCCG 4450 
CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 4500 
10 TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 4550 
CACCGTCATC ACCGAAACGC GCGAGGCAGT ATTCTTGAAG ACGAAAGGGC 4600 
CTCGTGATAC GCCTATTTTT ATAGGTTAAT GTCATGATAA TAATGGTTTC 4650 
TTAGACGTCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA ACCCCTATTT 4700 
GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA 4750 
15 CCCTGATAAA TCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA 4800 
ACATTTCCGT GTCGCCCTTA TTCCCTTTTT GGCGGCATTT TGCCTTCCTG 4850 
TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC TGAAGATCAG 4900 
TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 4950 
CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA 5000 
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AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTGATGACGC CGGGCAAGAG 5050 
CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC 5100 
ACCAGTCACA GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT 5150 
GCAGTGCTGC CATAACCATG AGTGATAACA CTGCGGCCAA CTTACTTCTG 5200 
5 ACAACGATCG GAGGACCGAA GGAGCTAACC GCTTTTTTGC ACAACATGGG 5250 
GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA 5300 
TACCAAACGA CGAGCGTGAC ACCACGATGC CAGCAGCAAT GGCAACAACG 5350 
TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA 5400 
ATTAATAGAC TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT 5450 
10 CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG AGCCGGTGAG 5500 
CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC 5550 
CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC 5600 
GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 5650 
CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA 5700 
15 TTTTTAATTT AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA 5750 
CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA 5800 
GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG 5850 
CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG 5900 
ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG 5950 
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CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT 6000 
CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC 6050 
CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA 6100 
AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CCGGGGGTTC 6150 
5 GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC 6200 
TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 6250 
GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA 6300 
GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC 6350 
ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC 6400 

10 CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 6450 
CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG 6500 
ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA 6550 
ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT 6600 
ACGCAAACCG CCTCTCCCCG CGCGTTGGCC GATTCATTAA TCCAGCTGGC 6650 

15 ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA CGCAATTAAT 6700 
GTGAGTTACC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 6750 
GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA 6800 
ACAGCTATGA CCATGATTAC GAATTAA 6827 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Glu Lys Ser Pro Glu Gin Arg Gly Arg Phe Arg Ala Met Val Glu 
15 10 15 

Leu Ala Arg Leu Asp Arg Arg Arg Pro Gly Ser Ser Asp Arg Val 
20 25 30 

10 Leu Phe Ala Arg Phe Ser Lys Thr Glu Gly Gly Ala Arg Arg Ala 

35 40 45 

Ser Pro Asp Gin Glu Ala Glu Asp Leu Trp Leu Ser Pro Leu Thr 
50 55 60 

Met Glu Asp Leu Val Cys Tyr Ser Phe Gin Val Ala Arg Gly Met 
15 65 70 ~ 75 

Glu Phe Leu Ala Ser Arg Lys Cys He His Arg Asp Leu Ala Ala 
80 85 90 

Arg Asn He Leu Leu Ser Glu Ser Asp Val Val Lys He Cys Asp 
95 100 105 

20 Phe Gly Leu Ala Arg Asp He Tyr Lys Asp Pro Asp Tyr Val Arg 

110 115 120 

Lys Gly Ser Ala Arg Leu Pro Leu Lys Trp Met Ala Pro Glu Ser 
125 130 135 

He Phe Asp Lys Val Tyr Thr Thr Gin Ser Asp Val Trp Ser Phe 
25 140 145 150 

Gly Val Leu Leu Trp Glu He Phe Ser Leu Gly Ala Ser Pro Tyr 
155 160 165 

Pro Gly Val Gin He Asn Glu Glu Phe Cys Gin Arg Leu Arg Asp 
170 175 180 

30 Gly Thr Arg Met Arg Ala Pro Glu Leu Ala Thr Pro Ala He Arg 

185 190 195 

Arg He Met Leu Asn Cys Trp Ser Gly Asp Pro Lys Ala Arg Pro 
200 205 210 

Ala Phe Ser Glu Leu Val Glu He Leu Gly Asp Leu Leu Gin Gly 
35 215 220 225 

Arg Gly Leu Gin Glu Glu Glu Glu Val Cys Met Ala Pro Arg Ser 
230 235 240 

Ser Gin Ser Ser Glu Glu Gly Ser Phe Ser Gin Val Ser Thr Met 
245 250 255 
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Ala Leu His lie Ala Gin Ala Asp Ala Glu Asp Ser Pro Pro Ser 
260 265 270 



Leu Gin Arg His Ser Leu Ala Ala Arg Tyr Tyr Asn Trp Val Ser 
275 280 285 

5 Phe Pro Gly Cys Leu Ala Arg Gly Ala Glu Thr Arg Gly Ser Ser 
290 295 300 

Arg Met Lys Thr Phe Glu Glu Phe Pro Met Thr Pro Thr Thr Tyr 
305 310 315 

Lys Gly Ser Val Asp Asn Gin Thr Asp Ser Gly Met Val Leu Ala 
10 320 325 330 

Ser Glu Glu Cys Glu Gin lie Glu. Ser Arg Tyr Arg Gin Glu Ser 
335 340 345 

Gly Phe Arg 
348 

15 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7607 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT 50 



TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 100 



TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 150 



25 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA 200 



TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 250 



ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 300 



AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC 350 



TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 400 



30 GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 450 
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TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA 500 

AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC 550 
AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 600 
TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT 650 
5 CCATAGAAGA CACCGGGACC GATCCAGCCT CCGCGGCCGG GAACGGTGCA 700 
TTGGAACGCG GATTCCCCGT GCCAAGAGTG ACGTAAGTAC CGCCTATAGA 750 
GTCTATAGGC CCACTTGGCT TCGTTAGAAC GCGGCTACAA TTAATACATA 800 
ACCTTATGTA TCATACACAT ACGATTTAGG TGACACTATA GAATAACATC 850 
CACTTTGCCT TTCTCTCCAC AGGTGTCCAC TCCCAGGTCC AACTGCACCT 900 
10 CGGTTCTATC GATTGAATTC CCCGGGGATC CTCTAGAGAT CCCTCGACCT 950 
CGAGTCGACT TTTTTTTTTT TTTTTGTAGG CCAAAGGGTA CTTCTTTTTC 1000 
TTTATTAATT ACTCAGAAGT CTAGGCCACA GCAATCTACT GTTCTCCTCT 1050 
CATTTTCCTA AACTATTTTG ATACCTATTT CTCAGACTTT ATGGGCTATT 1100 
AGACATTTCT CACATTTCCA TAGATAATAA CTCATCCGTT TTGCAACCTG 1150 
15 ATTCTCAATA TTAAGAGATT AAAACTAATG TATATGACTC TCAGTTGACA 1200 
CATACTGAAG TACAGAAAAA TTCCATCATT TCCTTCTGCA AAATGAAAAA 1250 
GACTTCGTTT TCTCAACAGC TGCATCATTT TTTTATGCAT AGAAAAAAAT 1300 
GTGCAATTAC TCCAAGTACA ATCAAGTCAT TTAACATGGC TTTACCATCA 1350 
TTGTAGTTAC AGGATATTTT AAAAGAGAAA AAAAAATCTC AAAGCACAGG 1400 
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TCCTGCTGTG CAGCAAAGCA ATCAAATTCC TTCATAATAA CA6CCT6ATG 1450 
GGATTCAGCA ATCTGAGGAA TAATGAATAA CCACTCTAAT CAGTAAACAG 1500 
GAAAATGCTA CAACAGTCAC TGAGTAAAAA TTGGACTATC ATCTGTTGAT 1550 
TCTCTTGATC GACATTTCAA ACAATAAATG GAAATGTAAG TATCTCTTAA 1600 
5 AAAGAAAAAT AACTTGGTTT AGTGTGCTTA ATTTTACCAG GCAGTGAGGA 1650 
AATTATATAT CACCTTGACT GTCCTGCAGT GTTGCCCAGT CAATAAAATG 1700 
CACAAATAAT CTTTTTCATA ATACATGGCC AACTTTATCC TATCACTTGA 1750 
ATATGTCAGG ATAAACTGAT TGTGCAGTTG GTTGATAACA TTGTATTTTG 1800 
GAATGGATTA TTTGAATTTG TTTTGCTACT TTATTATTTG ATATTCTTCT 1850 
10 CCAGTGTTCA TCTTATGAAG TTATTTGCAT CTGAATATGA AGAGTCTGTT 1900 
TCAAAATAGT CTTCAAGTTT CCAACGCAGT GTCTCAAATG TAGGTCGTTC 1950 
CTTAGGCTCT GCATTCCAGC ACTCCAACAT GATGTTGTAA AATTGCTGTG 2000 
GACAGTTGGA TGGTTGCGGA AGTCTATAGT TTTGAGCCAA CATCTGGATT 2050 
ACCTGGGCAC CTGTCATACC ACTGTAAGGC ATTTTGCCAT AAGTAATGAT 2100 
15 TTCATAAAGA AGGATTCCAA ATGACCATAC ATCGGACTTA ATGCTGAATT 2150 
TATTACTACG AATGGCTTCG GGCGCAGTCC ACTTCACCGG CAGCTTTATT 2200 
TCGTGTCTAG ATTCATAGAT GTCTTCATTA TCTACCTTAA AAACTCTGGC 2250 
AAGTCCAAAA TCTGCTACTT TGTAGATATT ATGTTCACCA ACGAGGACAT 2300 
TTCTGGCAGC CAGATCTCTG TGAATGTAGT TCCGAGACTC CAGATAGGCC 2350 
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ATTCCAGAGG CAACCTGTGC CGCCATGTCT ACCTGTTGAG TCAGATGGAT 2400 

TTTTGATCCA GTGTCATTTT GGAGATATTC TTGCAGACTT CCATGTCTCA 2450 

TCAACTCTGT AATAATATAA ATTGGATCTT CTAAAGTGCA AACAGCATAA 2500 

AGCTGGATAA GCTTTGGATG TCTTAGGTTC TTCATTATCT GTGCCTCCCT 2550 

5 CAGGAAGTCA TTTGGATCCA TTGAACCTGG TTTTAATGTT TTCACTGCTA 2600 

CTGGAGTGGT ATTGTTCCAC AGACCTTCCC ATACTTCGCC AAACTGACCA 2650 

GATCCCAATC GCTTCAGAAG CTGTATGGAG TTGCGGTCTA TCTCCCATTG 2700 

GTCCACGGTT TTATACGACA AATCAAATGG AGCTGGGACC TGGATCTTTA 2750 

AGCATGGTTT CCCCAGCTTG ACACACAGGC CGTCACTTGT CTTGGTGTAG 2800 

10 TGGCTCACAA ATTCGTTCAG TGTTGAAAAG ATTCTTCTTC GCGTGAGAAA 2850 
AAATCCCCCT TCATCCAGTC TTTTAATTCT GTAGTGTTTT ACAACTGCTC 2900 
CATCTAAAAC TGAAAGAGAG AATTCTCCTT TTTGGCTTTC ACTTTCTCTG 2950 
ATTAGAAAGG AACCGGTCTT GTTTTCTGAA TATAATAGTT GTTTCTCTGC 3000 
ATCTGATCTT CCGATTGCTC CAAAGAACCA CGGCTCTGCC TGTAGGCTTC 3050 

15 TGTCCTCAGC CACGTAGTTA GAAGGAATAT AGCCTTGTAG TTGCTGACTG 3100 
GAGCCATCTC GTCTTTTCTC CAAGTGTCTG GCAAACCACC AGCCCTCATG 3150 
CAAAGTGTCC AGAACTTGAA GTTTGTCACC TGCTCGGAAG CTCAAGTCCT 3200 
CAGCAGTCCG AGCCTGGTAA TCAAACAAAG CCACAAAGTA GTGGCCATGC 3250 
CTCTGTGACT GGGGAGAGCA AAGGGCCCCT GGATTTTCAA TCACGGTTGA 3300 
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CTTGTCTGCC TCCGTGGACA AACAGGGGAG ATAGGGTTCT AGGTACTCCC 3350 
AGAGCCTCTG ACAGATGTTG CTCATTGTGC CTTGGTGGGG AGAAGAGGAG 3400 
CAGGGCTTCT CCCTCTCCCC TTAGTCTCTG CGATCCACCT TATCTTCCTT 3450 
CACCAGGCAA CTTTGAAGTC AGCACCAACT CACCATACTT CGGAGAGTAT 3500 
5 GCAAAGTCCC GTTTCAGATC AGTCCAGCAG CTGGGTTGCA GCAAGTCCTA 3550 
CCTGGAGAGA CTTACCGGCT TGCTTTCTGT GGCTGGAGGT GCTACCCCGA 3600 
GGCAAAACTG AGCAGGAGCT GGGCAGCTGC TCACTAGGAA GGTGTCTTTT 3650 
CTTCTTATCT GCTTAAGAAT CCCACAACAA AAATAAAATA AAATTAAAAG 3700 
GGCTTTATTT AGACAAATAT CTGAGAACAG AATGGTGCCA TCTTGCCTTT 3750 
10 TGTCCCAATA AAAAGTTAGC AAGAGGAAGC TACTAACCCC TGGTAAAACC 3800 
TCCACGTCTT GCTTTCGCCA GGGTCGACTC GAGGGATCTT CCATACCTAC 3850 
CAGTTCTGCG CCTGCAGGTC GCGGCCGCGA CTCTAGAGTC GACCTGCAGA 3900 
AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 3950 
AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG 4000 
15 CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 4050 
GATCGGGAAT TAATTCGGCG CAGCACCATG GCCTGAAATA ACCTCTGAAA 4100 
GAGGAACTTG GTTAGGTACC TTCTGAGGCG GAAAGAACCA GCTGTGGAAT 4150 
GTGTGTCAGT TAGGGTGTGG AAAGTCCCCA GGCTCCCCAG CAGGCAGAAG 4200 
TATGCAAAGC ATGCATCTCA ATTAGTCAGC AACCAGGTGT GGAAAGTCCC 4250 
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CAGGCTCCCC AGCAGGCAGA AGTATGCAAA GCATGCATCT CAATTAGTCA 4300 
GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 4350 
CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG 4400 
CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA 4450 
5 GGCTTTTTTG GAGGCCTAGG CTTTTGCAAA AAGCTGTTAA CAGCTTGGCA 4500 
CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACCCTG GCGTTACCCA 4550 
ACTTAATCGC CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG 4600 
AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC 4650 
GAATGGCGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC 4700 
10 ACACCGCATA CGTCAAAGCA ACCATAGTAC GCGCCCTGTA GCGGCGCATT 4750 
AAGCGCGGCG GGTGTGGTGG TTACGCGCAG CGTGACCGCT ACACTTGCCA 4800 
GCGCCCTAGC GCCCGCTCCT TTCGCTTTCT TCCCTTCCTT TCTCGCCACG 4850 
TTCGCCGGCT TTCCCCGTCA AGCTCTAAAT CGGGGGCTCC CTTTAGGGTT 4900 
GCGATTTAGT GCTTTACGGC ACCTCGACCC CAAAAAACTT GATTTGGGTG 4950 
15 ATGGTTCACG TAGTGGGCCA TCGCCCTGAT AGACGGTTTT TCGCCCTTTG 5000 
ACGTTGGAGT CCACGTTCTT TAATAGTGGA CTCTTGTTCC AAACTGGAAC 5050 
AACACTCAAC CCTATCTCGG GCTATTCTTT TGATTTATAA GGGATTTTGC 5100 
CGATTTCGGC CTATTGGTTA AAAAATGAGC TGATTTAACA AAAATTTAAC 5150 
GCGAATTTTA ACAAAATATT AACGTTTACA ATTTTATGGT GCACTCTCAG 5200 
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TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGCCCCGA CACCCGCCAA 5250 
CACCCGCTGA CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC 5300 
AGACAAGCTG TGACCGTCTC CGGGAGCTGC ATGTGTCAGA GGTTTTCACC 5350 
GTCATCACCG AAACGCGCGA GACGAAAGGG CCTCGTGATA CGCCTATTTT 5400 
5 TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT 5450 
TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA 5500 
TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 5550 
AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT 5600 
ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC 5650 
10 GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 5700 
ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC 5750 
GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 5800 
GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 5850 
ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT 5900 
15 CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT 5950 
GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGACCGA 6000 
AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT 6050 
GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA 6100 
CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 6150 
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GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG 6200 

GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 6250 
GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA 6300 
TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC 6350 
5 ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 6400 
GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 6450 
CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC 6500 
TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA 6550 
GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT 6600 

10 CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 6650 
CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 6700 
TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTTC 6750 
TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG 6800 
CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 6850 

15 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA 6900 
AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG 6950 
GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 7000 
AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 7050 
GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC 7100 
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TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 7150 

ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA 7200 

ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 7250 

TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT 7300 

5 TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 7350 

CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC 7400 

GCGCGTTGGC CGATTCATTA ATGCAGCTGG CACGACAGGT TTCCCGACTG 7450 

GAAAGCGGGC AGTGAGCGCA ACGCAATTAA TGTGAGTTAG CTCACTCATT 7500 

AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA 7550 

10 ATTGTGAGCG GATAACAATT TCACACAGGA AACAGCTATG ACATGATTAC 7600 

GAATTAA 7607 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 505 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ser Asn lie Cys Gin Arg Leu Trp Glu Tyr Leu Glu Pro Tyr 
15 10 15 

20 Leu Pro Cys Leu Ser Thr Glu Ala Asp Lys Ser Thr Val lie Glu 

20 25 30 

Asn Pro Gly Ala Leu Cys Ser Pro Gin Ser Gin Arg His Gly His 
35 40 45 

Tyr Phe Val Ala Leu Phe Asp Tyr Gin Ala Arg Thr Ala Glu Asp 
25 50 55 60 

Leu Ser Phe Arg Ala Gly Asp Lys Leu Gin Val Leu Asp Thr Leu 
65 70 75 
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His Glu Gly Trp Trp Phe Ala Arg His Leu Glu Lys Arg Arg Asp 
80 85 90 

Gly Ser Ser Gin Gin Leu Gin Gly Tyr lie Pro Ser Asn Tyr Val 
95 100 105 

5 Ala Glu Asp Arg Ser Leu Gin Ala Glu Pro Trp Phe Phe Gly Ala 
110 115 120 

He Gly Arg Ser Asp Ala Glu Lys Gin Leu Leu Tyr Ser Glu Asn 
125 130 135 

Lys Thr Gly Ser Phe Leu He Arg Glu Ser Glu Ser Gin Lys Gly 
10 140 145 150 

Glu Phe Ser Leu Ser Val Leu Asp Gly Ala Val Val Lys His Tyr 
155 160 165 

Arg He Lys Arg Leu Asp Glu Gly Gly Phe Phe Leu Thr Arg Arg 
170 175 180 

15 Arg He Phe Ser Thr Leu Asn Glu Phe Val Ser His Tyr Thr Lys 

185 190 195 

Thr Ser Asp Gly Leu Cys Val Lys Leu Gly Lys Pro Cys Leu Lys 
200 205 210 

He Gin Val Pro Ala Pro Phe Asp Leu Ser Tyr Lys Thr Val Asp 
20 215 220 225 

Gin Trp Glu He Asp Arg Asn Ser He Gin Leu Leu Lys Arg Leu 
230 235 240 

Gly Ser Gly Gin Phe Gly Glu Val Trp Glu Gly Leu Trp Asn Asn 
245 250 255 

25 Thr Thr Pro Val Ala Val Lys Thr Leu Lys Pro Gly Ser Met Asp 

260 265 270 

Pro Asn Asp Phe Leu Arg Glu Ala Gin He Met Lys Asn Leu Arg 
275 280 285 

His Pro Lys Leu He Gin Leu Tyr Ala Val Cys Thr Leu Glu Asp 
30 290 295 300 

Pro He Tyr He He Thr Glu Leu Met Arg His Gly Ser Leu Gin 
305 310 315 

Glu Tyr Leu Gin Asn Asp Thr Gly Ser Lys He His Leu Thr Gin 
320 325 330 

35 Gin Val Asp Met Ala Ala Gin Val Ala Ser Gly Met Ala Tyr Leu 

335 340 345 

Glu Ser Arg Asn Tyr He His Arg Asp Leu Ala Ala Arg Asn Val 
350 355 360 
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Leu Val Gly Glu His Asn lie Tyr Lys Val Ala Asp Phe Gly Leu 

365 370 375 

Ala Arg Val Phe Lys Val Asp Asn Glu Asp lie Tyr Glu Ser Arg 

380 385 390 

5 His Glu lie Lys Leu Pro Val Lys Trp Thr Ala Pro Glu Ala He 

395 400 405 

Arg Ser Asn Lys Phe Ser He Lys Ser Asp Val Trp Ser Phe Gly 

410 415 420 

He Leu Leu Tyr Glu He He Thr Tyr Gly Lys Met Pro Tyr Ser 

10 425 430 435 

Gly Met Thr Gly Ala Gin Val He Gin Met Leu Ala Gin Asn Tyr 

440 445 450 

Arg Leu Pro Gin Pro Ser Asn Cys Pro Gin Gin Phe Tyr Asn He 

455 460 465 

15 Met Leu Glu Cys Trp Asn Ala Glu Pro Lys Glu Arg Pro Thr Phe 

470 475 480 

Glu Thr Leu Arg Trp Lys Leu Glu Asp Tyr Phe Glu Thr Asp Ser 

485 490 495 

Ser Tyr Ser Asp Ala Asn Asn Phe He Arg 
20 500 505 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 bases 

(B) TYPE: nucleic acid 
25 (C) STRAND EDNESS : single 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GCGGCCGCAG AGAAAGCAGA GGATGGGGCT TAGCAGCTGG CAGAGCCAGG 50 



AGCGGGGAGG TAGCAGAAAG ACCACAAGTA CAAAGAAGTC CTGAAACTTT 100 



30 GGTTTTGCTG CTGCAGCCCA TTGAGAGTGA CGACATGGAG CACAAGACCC 150 



TGAAGATCAC CGACTTTGGC CTGGCCCGAG AGTGGCACAA AACCACACAA 200 



ATGAGTGCCG CNGGCACCTA CNCCTGGATG GCTCCTGAGG TTATCAAGGC 250 



CTCCACCTTC TCTAAGGGCA GTGACGTCTG GAGTTTTGGG GTGCTGCTGT 300 
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GGGAACTGCT GACCGGGGAG NTGCCATACC GTGGCATTGA CTGCCTTGCT 350 
GTGGCCTATG GCGTAGCTGT TAACAAGCTC ACACTGCCAT CCATCCACCT 400 
GGCC 404 

(2) INFORMATION FOR SEQ ID NO: 22: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3120 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATGAGAGCGT TGGCGCGCGA CGGCGGCCAG CTGCCGCTGC TCGTTGTTTT 50 
TTCTGCAATG ATATTTGGGA CTATTACAAA TCAAGATCTG CCTGTGATCA 100 
AGTGTGTTTT AATCAATCAT AAGAACAATG ATTCATCAGT GGGGAAGTCA 150 
TCATCATATC CCATGGTATC AGAATCCCCG GAAGACCTCG GGTGTGCGTT 200 

15 GAGACCCCAG AGCTCAGGGA CAGTGTACGA AGCTGCCGCT GTGGAAGTGG 250 
ATGTATCTGC TTCCATCACA CTGCAAGTGC TGGTCGATGC CCCAGGGAAC 300 
ATTTCCTGTC TCTGGGTCTT TAAGCACAGC TCCCTGAATT GCCAGCCACA 350 
TTTTGATTTA CAAAACAGAG GAGTTGTTTC CATGGTCATT TTGAAAATGA 400 
CAGAAACCCA AGCTGGAGAA TACCTACTTT TTATTCAGAG TGAAGCTACC 450 

20 AATTACACAA TATTGTTTAC AGTGAGTATA AGAAATACCC TGCTTTACAC 500 
ATTAAGAAGA CCTTACTTTA GAAAAATGGA AAACCAGGAC GCCCTGGTCT 550 
GCATATCTGA GAGCGTTCCA GAGCGGATCC TGGAATGGGT GCTTTGCGAT 600 
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TCACAGGGGG AAAGCTGTAA AGAAGAAAGT CCAGCTGTTG TTAAAAAGGA 650 

GGAAAAAGTG CTTCATGAAT TATTTGGGAC GGACATAAGG TGCTGTGCCA 700 
GAAATGAACT GGGCAGGGAA TGCACCAGGC TGTTCACAAT AGATCTAAAT 750 
CAAACTCCTC AGACCACATT GCCACAATTA TTTCTTAAAG TAGGGGAACC 800 



5 CTTATGGATA AGGTGCAAAG CTGTTCATGT GAACCATGGA TTCGGGCTCA 850 
CCTGGGAATT AGAAAACAAA GCACTCGAGG AGGGCAACTA CTTTGAGATG 900 
AGTACCTATT CAACAAACAG AACTATGATA CGGATTCTGT TTGCTTTTGT 950 
ATCATCAGTG GCAAGAAACG ACACCGGATA CTACACTTGT TCCTCTTCAA 1000 
AGCATCCCAG TCAATCAGCT TTGGTTACCA TCGTAGAAAA GGGATTTATA 1050 

10 AATGCTACCA ATTCAAGTGA AGATTATGAA ATTGACCAAT ATGAAGAGTT 1100 
TTGTTTTTCT GTCAGGTTTA AAGCCTACCC ACAAATCAGA TGTACGTGGA 1150 
CCTTCTCTCG AAAATCATTT CCTTGTGAGC AAAAGGGTCT TGATAACGGA 1200 
TACAGCATAT CCAAGTTTTG CAATCATAAG CACCAGCCAG GAGAATATAT 1250 
ATTCCATGCA GAAAATGATG ATGCCCAATT TACCAAAATG TTCACGCTGT 1300 

15 ATATAAGAAG GAAACCTCAA GTCCTCGCAG AAGCTTCGGC AAGTCAGGCG 1350 
TCCTGTTTCT CGGATGGATA CCCATTACCA TCTTGGACCT GGAAGAAGTG 1400 
TTCAGACAAG TCTCCCAACT GCACAGAAGA GATCACAGAA GGAGTCTGGA 1450 
ATAGAAAGGC TAACAGAAAA GTGTTTGGAC AGTGGGTGTC GAGCAGTACT 1500 
CTAAACATGA GTGAAGCCAT AAAAGGGTTC CTGGTCAAGT GCTGTGCATA 1550 
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CAATTCCCTT GGCACATCTT GTGAGACGAT CCTTTTAAAC TCTCCAGGCC 1600 

CCTTCCCTTT CATCCAAGAC AACATCTCAT TCTATGCAAC AATTGGTGTT 1650 

TGTCTCCTCT TCATTGTCGT TTTAACCCTG CTAATTTGTC ACAAGTACAA 1700 

AAAGCAATTT AGGTATGAAA GCCAGCTACA GATGGTACAG GTGACCGGAT 1750 

5 CCTCAGATTA TGAGTACTTC TACGTTGATT TCAGAGAATA TGAATATGAT 1800 

GTCAAATGGG AGTTTCCAAG AGAAAATTTA GAGTTTGGGA AGGTACTAGG 1850 

ATCAGGTGCT TTTGGAAAAG TGATGAACGC AACAGCTTAT GGAATTAGCA 1900 

AAACAGGAGT CTCAATCCAG GTTACCGTCA AAATGCTGAA AGAAAAAGCA 1950 

GACAGCTCTG AAAGAGAGGC ACTCATGTCA GAACTCAAGA TGATGACCCA 2000 

10 GCTGGGAAGC CACGAGAATA TTGTGAACCT GCTGGGGGCG TGCACACTGT 2050 
CAGGACCAAT TTACTTGATT TTTGAATACT GTTGCTATGG TGATCTTCTC 2100 
AACTATCTAA GAAGTAAAAG AGAAAAATTT CACAGGACTT GGACAGAGAT 2150 
TTTCAAGGAA CACAATTTCA GTTTTTACCC CACTTTCCAA TCACATCCAA 2200 
ATTCCAGCAT GCCTGGTTCA AGAGAAGTTC AGATACACCC GGACTCGGAT 2250 

15 CAAATCTCAG GGCTTCATGG GAATTCATTT CACTCTGAAG ATGAAATTGA 2300 
ATATGAAAAC CAAAAAAGGC TGGAAGAAGA GGAGGACTTG AATGTGCTTA 2350 
CATTTGAAGA TCTTCTTTGC TTTGCATATC AAGTTGCCAA AGGAATGGAA 2400 
TTTCTGGAAT TTAAGTCGTG TGTTCACAGA GACCTGGCCG CCAGGAACGT 2450 
GCTTGTCACC CACGGGAAAG TGGTGAAGAT ATGTGACTTT GGATTGGCTC 2500 
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GAGATATCAT GAGTGATTCC AACTATGTTG TCAGGGGCAA TGCCCGTCTG 2550 

CCTGTAAAAT GGATGGCCCC CGAAAGCCTG TTTGAAGGCA TCTACACCAT 2600 

TAAGAGTGAT GTCTGGTCAT ATGGAATATT ACTGTGGGAA ATCTTCTCAC 2650 

TTGGTGTGAA TCCTTACCCT GGCATTCCGG TTGATGCTAA CTTCTACAAA 2700 

5 CTGATTCAAA ATGGATTTAA AATGGATCAG CCATTTTATG CTACAGAAGA 2750 

AATATACATT ATAATGCAAT CCTGCTGGGC TTTTGACTCA AGGAAACGGC 2800 

CATCCTTCCC TAATTTGACT TCGTTTTTAG GATGTCAGCT GGCAGATGCA 2850 

GAAGAAGCGA TGTATCAGAA TGTGGATGGC CGTGTTTCGG AATGTCCTCA 2900 

CACCTACCAA AACAGGCGAC CTTTCAGCAG AGAGATGGAT TTGGGGCTAC 2950 

10 TCTCTCCGCA GGCTCAGGTC GAAGATTCGT AGAGGAACAA TTTAGTTTTA 3000 

AGGACTTCAT CCCTCCACCT ATCCCTAACA GGCTGTAGAT TACCAAAACA 3050 

AGGTTAATTT CATCACTAAA AGAAAATCTA TTATCAACTG CTGCTTCACC 3100 

AGACTTTTCT CTAGAGAGCG 3120 

(2) INFORMATION FOR SEQ ID NO: 23: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3969 bases 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TCGGCGTCCA CCCGCCCAGG GAGAGTCAGA CCTGGGGGGG CGAGGGCCCC 50 
CCAAACTCAG TTCGGATCCT ACCCGAGTGA GGCGGCGCCA TGGAGCTCCG 100 
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GGTGCTGCTC TGCTGGGCTT CGTTGGCCGC AGCTTTGGAA GAGACCCTGC 150 
TGAACACAAA ATTGGAAACT GCTGATCTGA AGTGGGTGAC ATTCCCTCAG 200 
GTGGACGGGC AGTGGGAGGA ACTGAGCGGC CTGGATGAGG AACAGCACAG 250 
CGTGCGCACC TACGAAGTGT GTGACGTGCA GCGTGCCCCG GGCCAGGCCC 300 
5 ACTGGCTTCG CACAGGTTGG GTCCCACGGC GGGGCGCCGT CCACGTGTAC 350 
GCCACGCTGC GCTTCACCAT GCTCGAGTGC CTGTCCCTGC CTCGGGCTGG 400 
GCGCTCCTGC AAGGAGACCT TCACCGTCTT CTACTATGAG AGCGATGCGG 450 
ACACGGCCAC GGCCCTCACG CCAGCCTGGA TGGAGAACCC CTACATCAAG 500 
GTGGACACGG TGGCCGCGGA GCATCTCACC CGGAAGCGCC CTGGGGCCGA 550 

10 GGCCACCGGG AAGGTGAATG TCAAGACGCT GCGTCTGGGA CCGCTCAGCA 600 
AGGCTGGCTT CTACCTGGCC TTCCAGGACC AGGGTGCCTG CATGGCCCTG 650 
CTATCCCTGC ACCTCTTCTA CAAAAAGTGC GCCCAGCTGA CTGTGAACCT 700 
GACTCGATTC CCGGAGACTG TGCCTCGGGA GCTGGTTGTG CCCGTGGCCG 750 
GTAGCTGCGT GGTGGATGCC GTCCCCGCCC CTGGCCCCAG CCCCAGCCTC 800 

15 TACTGCCGTG AGGATGGCCA GTGGGCCGAA CAGCCGGTCA CGGGCTGCAG 850 
CTGTGCTCCG GGGTTCGAGG CAGCTGAGGG GAACACCAAG TGCCGAGCCT 900 
GTGCCCAGGG CACCTTCAAG CCCCTGTCAG GAGAAGGGTC CTGCCAGCCA 950 
TGCCCAGCCA ATAGCCACTC TAACACCATT GGATCAGCCG TCTGCCAGTG 1000 
CCGCGTCGGG TACTTCCGGG CACGCACAGA CCCCCGGGGT GCACCCTGCA 1050 
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CCACCCCTCC TTCGGCTCCG CGGAGCGTGG TTTCCCGCCT GAACGGCTCC 1100 

TCCCTGCACC TGGAATGGAG TGCCCCCCTG GAGTCTGGTG GCCGAGAGGA 1150 

CCTCACCTAC GCCCTCCGCT GCCGGGAGTG CCGACCCGGA GGCTCCTGTG 1200 

CGCCCTGCGG GGGAGACCTG ACTTTTGACC CCGGCCCCCG GGACCTGGTG 1250 

5 GAGCCCTGGG TGGTGGTTCG AGGGCTACGT CCTGACTTCA CCTATACCTT 1300 

TGAGGTCACT GCATTGAACG GGGTATCCTC CTTAGCCACG GGGCCCGTCC 1350 

CATTTGAGCC TGTCAATGTC ACCACTGACC GAGAGGTACC TCCTGCAGTG 1400 

TCTGACATCC GGGTGACGCG GTCCTCACCC AGCAGCTTGA GCCTGGCCTG 1450 

GGCTGTTCCC CGGGCACCCA GTGGGGCTGT GCTGGACTAC GAGGTCAAAT 1500 

10 ACCATGAGAA GGGCGCCGAG GGTCCCAGCA GCGTGCGGTT CCTGAAGACG 1550 
TCAGAAAACC GGGCAGAGCT GCGGGGGCTG AAGCGGGGAG CCAGCTACCT 1600 
GGTGCAGGTA CGGGCGCGCT CTGAGGCCGG CTACGGGCCC TTCGGCCAGG 1650 
AACATCACAG CCAGACCCAA CTGGATGAGA GCGAGGGCTG GCGGGAGCAG 1700 
CTGGCCCTGA TTGCGGGCAC GGCAGTCGTG GGTGTGGTCC TGGTCCTGGT 1750 

15 GGTCATTGTG GTCGCAGTTC TCTGCCTCAG GAAGCAGAGC AATGGGAGAG 1800 
AAGCAGAATA TTCGGACAAA CACGGACAGT ATCTCATCGG ACATGGTACT 1850 
AAGGTCTACA TCGACCCCTT CACTTATGAA GACCCTAATG AGGCTGTGAG 1900 
GGAATTTGCA AAAGAGATCG ATGTCTCCTA CGTCAAGATT GAAGAGGTGA 1950 
TTGGTGCAGG TGAGTTTGGC GAGGTGTGCC GGGGGCGGCT CAAGGCCCCA 2000 
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GGGAAGAAGG AGAGCTGTGT GGCAATCAAG ACCCTGAAGG GTGGCTACAC 2050 

GGAGCGGCAG CGGCGTGAGT TTCTGAGCGA GGCCTCCATC ATGGGCCAGT 2100 

TCGAGCACCC CAATATCATC CGCCTGGAGG GCGTGGTCAC CAACAGCATG 2150 

CCCGTCATGA TTCTCACAGA GTTCATGGAG AACGGCGCCC TGGACTCCTT 2200 

5 CCTGCGGCTA AACGACGGAC AGTTCACAGT CATCCAGCTC GTGGGCATGC 2250 

TGCGGGGCAT CGCCTCGGGC ATGCGGTACC TTGCCGAGAT GAGCTACGTC 2300 

CACCGAGACC TGGCTGCTCG CAACATCCTA GTCAACAGCA ACCTCGTCTG 2350 

CAAAGTGTCT GACTTTGGCC TTTCCCGATT CCTGGAGGAG AACTCTTCCG 2400 

ATCCCACCTA CACGAGCTCC CTGGGAGGAA AGATTCCCAT CCGATGGACT 2450 

10 GCCCCGGAGG CCATTGCCTT CCGGAAGTTC ACTTCCGCCA GTGATGCCTG 2500 

GAGTTACGGG ATTGTGATGT GGGAGGTGAT GTCATTTGGG GAGAGGCCGT 2550 

ACTGGGACAT GAGCAATCAG GACGTGATCA ATGCCATTGA ACAGGACTAC 2600 

CGGCTGCCCC CGCCCCCAGA CTGTCCCACC TCCCTCCACC AGCTCATGCT 2650 

GGACTGTTGG CAGAAAGACC GGAATGCCCG GCCCCGCTTC CCCCAGGTGG 2700 

15 TCAGCGCCCT GGACAAGATG ATCCGGAACC CCGCCAGCCT CAAAATCGTG 2750 

GCCCGGGAGA ATGGCGGGGC CTCACACCCT CTCCTGGACC AGCGGCAGCC 2800 

TCACTACTCA GCTTTTGGCT CTGTGGGCGA GTGGCTTCGG GCCATCAAAA 2850 

TGGGAAGATA CGAAGAAAGT TTCGCAGCCG CTGGCTTTGG CTCCTTCGAG 2900 

CTGGTCAGCC AGATCTCTGC TGAGGACCTG CTCCGAATCG GAGTCACTCT 2950 
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GGCGGGACAC CAGAAGAAAA TCTTGGCCAG TGTCCAGCAC ATGAAGTCCC 3000 

AGGCCAAGCC GGGAACCCCG GGTGGGACAG GAGGACCGGC CCCGCAGTAC 3050 
TGACCTGCAG GAACTCCCCA CCCCAGGGAC ACCGCCTCCC CATTTTCCGG 3100 
GGCAGAGTGG GGACTCACAG AGGCCCCCAG CCCTGTGCCC CGCTGGATTG 3150 
5 CACTTTGAGC CCGTGGGGTG AGGAGTTGGC AATTTGGAGA GACAGGATTT 3200 
GGGGGTTCTG CCATAATAGG AGGGGAAAAT CACCCCCCAG CCACCTCGGG 3250 
GAACTCCAGA CCAAGGGTGA GGGCGCCTTT CCCTCAGGAC TGGGTGTGAC 3300 
CAGAGGAAAA GGAAGTGCCC AACATCTCCC AGCCTCCCCA GGTGCCCCCC 3350 
TCACCTTGAT GGGTGCGTTC CCGCAGACCA AAGAGAGTGT GACTCCCTTG 3400 

10 CCAGCTCCAG AGTGGGGGGG CTGTCCCAGG GGGCAAGAAG GGGTGTCAGG 3450 
GCCCAGTGAC AAAATCATTG GGGTTTGTAG TCCCAACTTG CTGCTGTCAC 3500 
CACCAAACTC AATCATTTTT TTCCCTTGTA AATGCCCCTC CCCCAGCTGC 3550 
TGCCTTCATA TTGAAGGTTT TTGAGTTTTG TTTTTGGTCT TAATTTTTCT 3600 
CCCCGTTCCC TTTTTGTTTC TTCGTTTTGT TTTTCTACCG TCCTTGTCAT 3650 

15 AACTTTGTGT TGGAGGGAAC CTGTTTCACT ATGGCCTCCT TTGCCCAAGT 3700 
TGAAACAGGG GCCCATCATC ATGTCTGTTT CCAGAACAGT GCCTTGGTCA 3750 
TCCCACATCC CCGGACCCCG CCTGGGACCC CCAAGCTGTG TCCTATGAAG 3800 
GGGTGTGGGG TGAGGTAGTG AAAAGGGCGG TAGTTGGTGG TGGAACCCAG 3850 
AAACGGACGC CGGTGCTTGG AGGGGTTCTT AAATTATATT TAAAAAAGTA 3900 
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ACTTTTTGTA TAAATAAAAG AAAATGGGAC GTGTCCCAGC TCCAGGGGTA 3950 



AAAAAAAAAA AAAAAAAAA 3969 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1276 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Glu Leu Arg Val Leu Leu Cys Trp Ala Ser Leu Ala Ala Ala 
10 1 5 10 15 

Leu Glu Glu Thr Leu Leu Asn Thr Lys Leu Glu Thr Ala Asp Leu 
20 25 30 

Lys Trp Val Thr Phe Pro Gin Val Asp Gly Gin Trp Glu Glu Leu 
35 40 45 

15 Ser Gly Leu Asp Glu Glu Gin His Ser Val Arg Thr Tyr Glu Val 

50 55 60 

Cys Asp Val Gin Arg Ala Pro Gly Gin Ala His Trp Leu Arg Thr 
65 70 75 

Gly Trp Val Pro Arg Arg Gly Ala Val His Val Tyr Ala Thr Leu 
20 80 85 90 

Arg Phe Thr Met Leu Glu Cys Leu Ser Leu Pro Arg Ala Gly Arg 
95 100 105 

Ser Cys Lys Glu Thr Phe Thr Val Phe Tyr Tyr Glu Ser Asp Ala 
110 115 120 

25 Asp Thr Ala Thr Ala Leu Thr Pro Ala Trp Met Glu Asn Pro Tyr 

125 130 135 

lie Lys Val Asp Thr Val Ala Ala Glu His Leu Thr Arg Lys Arg 
140 145 150 

Pro Gly Ala Glu Ala Thr Gly Lys Val Asn Val Lys Thr Leu Arg 
30 155 160 165 

Leu Gly Pro Leu Ser Lys Ala Gly Phe Tyr Leu Ala Phe Gin Asp 
170 175 180 

Gin Gly Ala Cys Met Ala Leu Leu Ser Leu His Leu Phe Tyr Lys 
185 190 195 

35 Lys Cys Ala Gin Leu Thr Val Asn Leu Thr Arg Phe Pro Glu Thr 

200 205 210 
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Val Pro Arg Glu Leu Val Val Pro Val Ala Gly Ser Cys Val Val 
215 220 225 

Asp Ala Val Pro Ala Pro Gly Pro Ser Pro Ser Leu Tyr Cys Arg 
230 235 240 

5 Glu Asp Gly Gin Trp Ala Glu Gin Pro Val Thr Gly Cys Ser Cys 
245 250 255 

Ala Pro Gly Phe Glu Ala Ala Glu Gly Asn Thr Lys Cys Arg Ala 
260 265 270 

Cys Ala Gin Gly Thr Phe Lys Pro Leu Ser Gly Glu Gly Ser Cys 
10 275 280 285 

Gin Pro Cys Pro Ala Asn Ser His Ser Asn Thr lie Gly Ser Ala 
290 295 300 

Val Cys Gin Cys Arg Val Gly Tyr Phe Arg Ala Arg Thr Asp Pro 
305 310 315 

15 Arg Gly Ala Pro Cys Thr Thr Pro Pro Ser Ala Pro Arg Ser Val 

320 325 330 

Val Ser Arg Leu Asn Gly Ser Ser Leu His Leu Glu Trp Ser Ala 
335 340 345 

Pro Leu Glu Ser Gly Gly Arg Glu Asp Leu Thr Tyr Ma Leu Arg 
20 350 355 360 

Cys Arg Glu Cys Arg Pro Gly Gly Ser Cys Ala Pro Cys Gly Gly 
365 370 375 

Asp Leu Thr Phe Asp Pro Gly Pro Arg Asp Leu Val Glu Pro Trp 
380 385 390 

25 Val Val Val Arg Gly Leu Arg Pro Asp Phe Thr Tyr Thr Phe Glu 

395 400 405 

Val Thr Ala Leu Asn Gly Val Ser Ser Leu Ala Thr Gly Pro Val 
410 415 420 

Pro Phe Glu Pro Val Asn Val Thr Thr Asp Arg Glu Val Pro Pro 
30 425 430 435 

Ala Val Ser Asp lie Arg Val Thr Arg Ser Ser Pro Ser Ser Leu 
440 445 450 

Ser Leu Ala Trp Ala Val Pro Arg Ala Pro Ser Gly Ala Val Leu 
455 460 465 

35 Asp Tyr Glu Val Lys Tyr His Glu Lys Gly Ala Glu Gly Pro Ser 

470 475 480 

Ser Val Arg Phe Leu Lys Thr Ser Glu Asn Arg Ala Glu Leu Arg 
485 490 495 
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Gly Leu Lys Arg Gly Ala Ser Tyr Leu Val Gin Val Arg Ala Arg 
500 505 510 

Ser Glu Ala Gly Tyr Gly Pro Phe Gly Gin Glu His His Ser Gin 
515 520 525 

5 Thr Gin Leu Asp Glu Ser Glu Gly Trp Arg Glu Gin Leu Ala Leu 
530 535 540 

He Ala Gly Thr Ala Val Val Gly Val Val Leu Val Leu Val Val 
S45 550 555 

He Val Val Ala Val Leu Cys Leu Arg Lys Gin Ser Asn Gly Arg 
10 560 565 570 

Glu Ala Glu Tyr Ser Asp Lys His Gly Gin Tyr Leu He Gly His 
575 580 585 

Gly Thr Lys Val Tyr He Asp Pro Phe Thr Tyr Glu Asp Pro Asn 
590 595 600 

15 Glu Ala Val Arg Glu Phe Ala Lys Glu He Asp Val Ser Tyr Val 

605 610 615 

Lys He Glu Glu Val He Gly Ala Gly Glu Phe Gly Glu Val Cys 
620 625 630 

Arg Gly Arg Leu Lys Ala Pro Gly Lys Lys Glu Ser Cys Val Ala 
20 635 640 645 

He Lys Thr Leu Lys Gly Gly Tyr Thr Glu Arg Gin Arg Arg Glu 
650 655 660 

Phe Leu Ser Glu Ala Ser lie Met Gly Gin Phe Glu His Pro Asn 
665 670 675 

25 He He Arg Leu Glu Gly Val Val Thr Asn Ser Met Pro Val Met 

680 685 690 

He Leu Thr Glu Phe Met Glu Asn Gly Ala Leu Asp Ser Phe Leu 

695 700 705 . 

Arg Leu Asn Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met 
30 710 715 720 

Leu Arg Gly He Ala Ser Gly Met Arg Tyr Leu Ala Glu Met Ser 
725 730 735 

Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser 
740 745 750 

35 Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Phe Leu 

755 760 765 

Glu Glu Asn Ser Ser Asp Pro Thr Tyr Thr Ser Ser Leu Gly Gly 
770 775 780 
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Lys He Pro He Arg Trp Thr Ala Pro Glu Ala He Ala Phe Arg 
785 790 795 

Lys Phe Thr Ser Ala Ser Asp Ala Trp Ser Tyr Gly He Val Met 
800 805 810 

5 Trp Glu Val Met Ser Phe Gly Glu Arg Pro Tyr Trp Asp Met Ser 
815 820 825 

Asn Gin Asp Val He Asn Ala He Glu Gin Asp Tyr Arg Leu Pro 
830 835 840 

Pro Pro Pro Asp Cys Pro Thr Ser Leu His Gin Leu Met Leu Asp 
10 845 850 855 

Cys Trp Gin Lys Asp Arg Asn Ala Arg Pro Arg Phe Pro Gin Val 
860 865 870 

Val Ser Ala Leu Asp Lys Met He Arg Asn Pro Ala Ser Leu Lys 
875 680 885 

15 He Val Ala Arg Glu Asn Gly Gly Ala Ser His Pro Leu Leu Asp 

890 895 900 

Gin Arg Gin Pro His Tyr Ser Ala Phe Gly Ser Val Gly Glu Trp 
90S 910 915 

Leu Arg Ala He Lys Met Gly Arg Tyr Glu Glu Ser Phe Ala Ala 
20 920 925 930 

Ala Gly Phe Gly Ser Phe Glu Leu Val Ser Gin He Ser Ala Glu 
935 940 945 

Asp Leu Leu Arg He Gly Val Thr Leu Ala Gly His Gin Lys Lys 
950 955 960 

25 He Leu Ala Ser Val Gin His Met Lys Ser Gin Ala Lys Pro Gly 

965 970 975 

Thr Pro Gly Gly Thr Gly Gly Pro Ala Pro Gin Tyr Pro Ala Gly 
980 985 990 

Thr Pro His Pro Arg Asp Thr Ala Ser Pro Phe Ser Gly Ala Glu 
30 995 1000 1005 

Trp Gly Leu Thr Glu Ala Pro Ser Pro Val Pro Arg Trp He Ala 
1010 1015 1020 

Leu Ala Arg Gly Val Arg Ser Trp Gin Phe Gly Glu Thr Gly Phe 
1025 1030 1035 

35 Gly Gly Ser Ala He He Gly Gly Glu Asn His Pro Pro Ala Thr 
1040 1045 1050 

Ser Gly Asn Ser Arg Pro Arg Val Arg Ala Pro Phe Pro Gin Asp 
1055 1060 1065 
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Trp Val Pro Glu Glu Lys Glu Val Pro Asn lie Ser Gin Pro Pro 
1070 1075 1080 

Gin Val Pro Pro Ser Pro Trp Val Arg Ser Arg Arg Pro Lys Arg 
1085 1090 1095 

5 Val Leu Pro Cys Gin Leu Gin Ser Gly Gly Ala Val Pro Gly Gly 
1100 1105 1110 

Lys Lys Gly Cys Gin Gly Pro Val Thr Lys Ser Leu Gly Phe Val 
1115 1120 1125 

Val Pro Thr Cys Cys Cys His His Gin Thr Gin Ser Phe Phe Ser 
10 1130 1135 1140 

Leu Val Asn Ala Pro Pro Pro Ala Ala Ala Phe He Leu Lys Val 
1145 1150 1155 

Phe Glu Phe Cys Phe Trp Ser Phe Phe Ser Pro Phe Pro Phe Cys 
1160 1165 1170 

15 Phe Phe Val Leu Phe Phe Tyr Arg Pro Cys His Asn Phe Val Leu 
1175 1180 1185 

Glu Gly Thr Cys Phe Thr Met Ala Ser Phe Ala Gin Val Glu Thr 
1190 1195 1200 

Gly Ala His His His Val Cys Phe Gin Asn Ser Ala Leu Val He 
20 1205 1210 1215 

Pro His Pro Arg Thr Pro Pro Gly Thr Pro Lys Leu Cys Pro Met 
1220 1225 1230 

Lys Gly Cys Gly Val Arg Lys Gly Arg Leu Val Val Glu Pro Arg 
1235 1240 1245 

25 Asn Gly Arg Arg Cys Leu Glu Gly Phe Leu Asn Tyr He Lys Ser 
1250 1255 1260 

Asn Phe Leu Tyr Lys Lys Lys Met Gly Arg Val Pro Ala Pro Gly 
1265 1270 1275 

Val 
30 1276 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 59 amino acids 
(6) TYPE : amino acid 
35 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser 
15 10 15 
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Asp Phe Gly Leu Ser Arg Phe Leu Glu Asp Asp Thr Ser Asp Pro 
20 25 30 

■ Thr Tyr Thr Ser Ala Leu Gly Gly Lys lie Pro Met Arg Trp Thr 
35 40 45 

5 Ala Pro Glu Ala lie Gin Tyr Arg Lys Phe Ala Ser Ala Ser 

50 55 59 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asn Val Leu Val Lys Ser Pro Asn His Val Lys lie Thr Asp Phe 
1 5 10 15 

15 Gly Leu Ala Arg Leu Leu Glu Gly Asp Glu Lys Glu Tyr Asn Ala 

20 25 30 

Asp Gly Gly Lys Met Pro He Lys Trp Met Ala Leu Glu Cys He 
35 40 45 

His Tyr Arg Lys Phe Thr His Gin Ser 
20 50 54 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Asn Cys Met Leu Ala Gly Asp Met Thr Val Cys Val Ala Asp Phe 
15 10 15 

Gly Leu Ser Trp Lys He Tyr Ser Gly Ala Thr He Val Arg Gly 
30 20 25 ' 30 

Cys Ala Ser Lys Leu Pro Val Lys Trp Leu Ala Leu Gly Ser Leu 
35 40 45 

Ala Asp Asn Leu Tyr Thr Val His Ser 
50 54 

35 (2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asn Cys Leu Val Gly Lys Asn Tyr Thr lie Lys He Ala Asp Phe 
1 5 10 15 

Gly Met Ser Arg Asn Leu Tyr Ser Gly Asp Tyr Tyr 
5 20 25 27 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Thr Arg Asn He Leu Val Glu Asn Glu Asn Arg Val Lys He Gly 
1 5 10 15 

Asp Phe Gly Leu Thr Lys Val Leu Pro Gin Asp Lys Glu Tyr Tyr 
15 20 25 30 

Lys Val Lys Glu Pro Gly Glu Ser Pro He Phe Trp Tyr Ala Pro 
35 40 45 

Glu Ser Leu Thr Glu Ser Leu Phe Ser Val Ala Ser Asp 
50 55 58 

20 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser 
15 10 15 

Asp Phe Gly Met Ser Arg Val Leu Glu Asp Asp Pro Glu Ala Ala 
20 25 30 

30 Tyr Thr Thr Arg Gly Gly Lys He Pro He Arg Trp Thr Ala Pro 

35 40 45 

Glu Ala lie Ala Tyr Arg Lys Phe Thr Ser Ala Ser Asp 
50 55 . 58 

(2) INFORMATION FOR SEQ ID NO:31: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4425 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

TCGGGTCGGA CCCACGCGCA GCGGCCGGAG ATGCAGCGGG GCGCCGCGCT 50 

GTGCCTGCGA CTGTGGCTCT GCCTGGGACT CCTGGACGGC CTGGTGAGTG 100 

GCTACTCCAT GACCCCCCCG ACCTTGAACA TCACGGAGGA GTCACACGTC 150 

5 ATCGACACCG GTGACAGCCT GTCCATCTCC TGCAGGGGAC AGCACCCCCT 200 

CGAGTGGGCT TGGCCAGGAG CTCAGGAGGC GCCAGCCACC GGAGACAAGG 250 

ACAGCGAGGA CACGGGGGTG GTGCGAGACT GCGAGGGCAC AGACGCCAGG 300 

CCCTACTGCA AGGTGTTGCT GCTGCACGAG GTACATGCCA ACGACACAGG 350 

CAGCTACGTC TGCTACTACA AGTACATCAA GGCACGCATC GAGGGCACCA 400 

10 CGGCCGCCAG CTCCTACGTG TTCGTGAGAG ACTTTGAGCA GCCATTCATC 450 
AACAAGCCTG ACACGCTCTT GGTCAACAGG AAGGACGCCA TGTGGGTGCC 500 
CTGTCTGGTG TCCATCCCCG GCCTCAATGT CACGCTGCGC TCGCAAAGCT 550 
CGGTGCTGTG GCCAGACGGG CAGGAGGTGG TGTGGGATGA CCGGCGGGGC 600 
ATGCTCGTGT CCACGCCACT GCTGCACGAT GCCCTGTACC TGCAGTGCGA 650 

15 GACCACCTGG GGAGACCAGG ACTTCCTTTC CAACCCCTTC CTGGTGCACA 700 
TCACAGGCAA CGAGCTCTAT GACATCCAGC TGTTGCCCAG GAAGTCGCTG 750 
GAGCTGCTGG TAGGGGAGAA GCTGGTCCTG AACTGCACCG TGTGGGCTGA BOO 
GTTTAACTCA GGTGTCACCT TTGACTGGGA CTACCCAGGG AAGCAGGCAG 850 
AGCGGGGTAA GTGGGTGCCC GAGCGACGCT CCCAGCAGAC CCACACAGAA 900 
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CTCTCCAGCA TCCTGACCAT CCACAACGTC AGCCAGCACG ACCTGGGCTC 950 
GTATGTGTGC AAGGCCAACA ACGGCATCCA GCGATTTCGG GAGAGCACCG 1000 
AGGTCATTGT GCATGAAAAT CCCTTCATCA GCGTCGAGTG GCTCAAAGGA 1050 
CCCATCCTGG AGGCCACGGC AGGAGACGAG CTGGTGAAGC TGCCCGTGAA 1100 
5 GCTGGCAGCG TACCCCCCGC CCGAGTTCCA GTGGTACAAG GATGGAAAGG 1150 
CACTGTCCGG GCGCCACAGT CCACATGCCC TGGTGCTCAA GGAGGTGACA 1200 
GAGGCCAGCA CAGGCACCTA CACCCTCGCC CTGTGGAACT CCGCTGCTGG 1250 
CCTGAGGCGC AACATCAGCC TGGAGCTGGT GGTGAATGTG CCCCCCCAGA 1300 
TACATGAGAA GGAGGCCTCC TCCCCCAGCA TCTACTCGCG TCACAGCCGC 1350 

10 CAGGCCCTCA CCTGCACGGC CTACGGGGTG CCCCTGCCTC TCAGCATCCA 1400 
GTGGCACTGG CGGCCCTGGA CACCCTGCAA GATGTTTGCC CAGCGTAGTC 1450 
TCCGGCGGCG GCAGCAGCAA GACCTCATGC CACAGTGCCG TGACTGGAGG 1500 
GCGGTGACCA CGCAGGATGC CGTGAACCCC ATCGAGAGCC TGGACACCTG 1550 
GACCGAGTTT GTGGAGGGAA AGAATAAGAC TGTGAGCAAG CTGGTGATCC 1600 

15 AGAATGCCAA CGTGTCTGCC ATGTACAAGT GTGTGGTCTC CAACAAGGTG 1650 
GGCCAGGATG AGCGGCTCAT CTACTTCTAT GTGACCACCA TCCCCGACGG 1700 
CTTCACCATC GAATCCAAGC CATCCGAGGA GCTACTAGAG GGCCAGCCGG 1750 
TGCTCCTGAG CTGCCAAGCC GACAGCTACA AGTACGAGCA TCTGCGCTGG 1800 
TACCGCCTCA ACCTGTCCAC GCTGCACGAT GCGCACGGGA ACCCGCTTCT 1850 
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GCTCGACTGC AAGAACGTGC ATCTGTTCGC CACCCCTCTG GCCGCCAGCC 1900 

TGGAGGAGGT GGCACCTGGG GCGCGCCACG CCACGCTCAG CCTGAGTATC 1950 

CCCCGCGTCG CGCCCGAGCA CGAGGGCCAC TATGTGTGCG AAGTGCAAGA 2000 

CCGGCGCAGC CATGACAAGC ACTGCCACAA GAAGTACCTG TCGGTGCAGG 2050 

5 CCCTGGAAGC CCCTCGGCTC ACGCAGAACT TGACCGACCT CCTGGTGAAC 2100 

GTGAGCGACT CGCTGGAGAT GCAGTGCTTG GTGGCCGGAG CGCACGCGCC 2150 

CAGCATCGTG TGGTACAAAG ACGAGAGGCT GCTGGAGGAA AAGTCTGGAG 2200 

TCGACTTGGC GGACTCCAAC CAGAAGCTGA GCATCCAGCG CGTGCGCGAG 2250 

GAGGATGCGG GACGCTATCT GTGCAGCGTG TGCAACGCCA AGGGCTGCGT 2300 

10 CAACTCCTCC GCCAGCGTGG CCGTGGAAGG CTCCGAGGAT AAGGGCAGCA 2350 
TGGAGATCGT GATCCTTGTC GGTACCGGCG TCATCGCTGT CTTCTTCTGG 2400 
GTCCTCCTCC TCCTCATCTT CTGTAACATG AGGAGGCCGG CCCACGCAGA 2450 
CATCAAGACG GGCTACCTGT CCATCATCAT GGACCCCGGG GAGGTGCCTC 2500 
TGGAGGAGCA ATGCGAATAC CTGTCCTACG ATGCCAGCCA GTGGGAATTC 2550 

15 CCCCGAGAGC GGCTGCACCT GGGGAGAGTG CTCGGCTACG GCGCCTTCGG 2600 
GAAGGTGGTG GAAGCCTCCG CTTTCGGCAT CCACAAGGGC AGCAGCTGTG 2650 
ACACCGTGGC CGTGAAAATG CTGAAAGAGG GCGCCACGGC CAGCGAGCAC 2700 
CGCGCGCTGA TGTCGGAGCT CAAGATCCTC ATTCACATCG GCAACCACCT 2750 
CAACGTGGTC AACCTCCTCG GGGCGTGCAC CAAGCCGCAG GGCCCCCTCA 2800 
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TGGTGATCGT GGAGTTCTGC AAGTACGGCA ACCTCTCCAA CTTCCTGCGC 2850 

GCCAAGCGGG ACGCCTTCAG CCCCTGCGCG GAGAAGTCTC CCGAGCAGCG 2900 

CGGACGCTTC CGCGCCATGG TGGAGCTCGC CAGGCTGGAT CGGAGGCGGC 2950 

CGGGGAGCAG CGACAGGGTC CTCTTCGCGC GGTTCTCGAA GACCGAGGGC 3000 

5 GGAGCGAGGC GGGCTTCTCC AGACCAAGAA GCTGAGGACC TGTGGCTGAG 3050 

CCCGCTGACC ATGGAAGATC TTGTCTGCTA CAGCTTCCAG GTGGCCAGAG 3100 

GGATGGAGTT CCTGGCTTCC CGAAAGTGCA TCCACAGAGA CCTGGCTGCT 3150 

CGGAACATTC TGCTGTCGGA AAGCGACGTG GTGAAGATCT GTGACTTTGG 3200 

CCTTGCCCGG GACATCTACA AAGACCCTGA CTACGTCCGC AAGGGCAGTG 3250 

10 CCCGGCTGCC CCTGAAGTGG ATGGCCCCTG AAAGCATCTT CGACAAGGTG 3300 
TACACCACGC AGAGTGACGT GTGGTCCTTT GGGGTGCTTC TCTGGGAGAT 3350 
CTTCTCTCTG GGGGCCTCCC CGTACCCTGG GGTGCAGATC AATGAGGAGT 3400 
TCTGCCAGCG GCTGAGAGAC GGCACAAGGA TGAGGGCCCC GGAGCTGGCC 3450 
ACTCCCGCCA TACGCCGCAT CATGCTGAAC TGCTGGTCCG GAGACCCCAA 3500 

15 GGCGAGACCT GCATTCTCGG AGCTGGTGGA GATCCTGGGG GACCTGCTCC 3550 
AGGGCAGGGG CCTGCAAGAG GAAGAGGAGG TCTGCATGGC CCCGCGCAGC 3600 
TCTCAGAGCT CAGAAGAGGG CAGCTTCTCG CAGGTGTCCA CCATGGCCCT 3650 
ACACATCGCC CAGGCTGACG CTGAGGACAG CCCGCCAAGC CTGCAGCGCC 3700 
ACAGCCTGGC CGCCAGGTAT TACAACTGGG TGTCCTTTCC CGGGTGCCTG 3750 
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GCCAGAGGGG CTGAGACCCG TGGTTCCTCC AGGATGAAGA CATTTGAGGA 3800 

ATTCCCCATG ACCCCAACGA CCTACAAAGG CTCTGTGGAC AACCAGACAG 3850 

ACAGTGGGAT GGTGCTGGCC TCGGAGGAGT TTGAGCAGAT AGAGAGCAGG 3900 

CATAGACAAG AAAGCGGCTT CAGGTAGCTG AAGCAGAGAG AGAGAAGGCA 3950 



5 GCATACGTCA GCATTTTCTT CTCTGCACTT ATAAGAAAGA TCAAAGACTT 4000 
TAAGACTTTC GCTATTTCTT CTGCTATCTA CTACAAACTT CAAAGAGGAA 4050 
CCAGGAGGCC AAGAGGAGCA TGAAAGTGGA CAAGGAGTGT GACCACTGAA 4100 
GCACCACAGG GAGGGGTTAG GCCTCCGGAT GACTGCGGGC AGGCCTGGAT 4150 
AATATCCAGC CTCCCACAAG AAGCTGGTGG AGCAGAGTGT TCCCTGACTC 4200 
10 CTCCAAGGAA AGGGAGACGC CCTTTCATGG TCTGCTGAGT AACAGGTGCC 4250 
TTCCCAGACA CTGGCGTTAC TGCTTGACCA AAGAGCCCTC AAGCGGCCCT 4300 
TATGCCAGCG TGACAGAGGG CTCACCTCTT GCCTTCTAGG TCACTTCTCA 4350 
CAATGTCCCT TCAGCACCTG ACCCTGTGCC CGCCAGTTAT TCCTTGGTAA 4400 
TATGAGTAAT ACATCAAAGA GTAGT 4425 

15 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4425 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AGCCCAGCCT GGGTGCGCGT CGCCGGCCTC TACGTCGCCC CGCGGCGCGA 50 
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CACGGACGCT GACACCGAGA CGGACCCTGA GGACCTGCCG GACCACTCAC 100 
CGATGAGGTA CTGGGGGGGC TGGAACTTGT AGTGCCTCCT CAGTGTGCAG 150 
TAGCTGTGGC CACTGTCGGA CAGGTAGAGG ACGTCCCCTG TCGTGGGGGA 200 
GCTCACCCGA ACCGGTCCTC GAGTCCTCCG CGGTCGGTGG CCTCTGTTCC 250 
5 TGTCGCTCCT GTGCCCCCAC CACGCTCTGA CGCTCCCGTG TCTGCGGTCC 300 
GGGATGACGT TCCACAACGA CGACGTGCTC CATGTACGGT TGCTGTGTCC 350 
GTCGATGCAG ACGATGATGT TCATGTAGTT CCGTGCGTAG CTCCCGTGGT 400 
GCCGGCGGTC GAGGATGCAC AAGCACTCTC TGAAACTCGT CGGTAAGTAG 450 
TTGTTCGGAC TGTGCGAGAA CCAGTTGTCC TTCCTGCGGT ACACCCACGG 500 

10 GACAGACCAC AGGTAGGGGC CGGAGTTACA GTGCGACGCG AGCGTTTCGA 550 
GCCACGACAC CGGTCTGCCC GTCCTCCACC ACACCCTACT GGCCGCCCCG 600 
TACGAGCACA GGTGCGGTGA CGACGTGCTA CGGGACATGG ACGTCACGCT 650 
CTGGTGGACC CCTCTGGTCC TGAAGGAAAG GTTGGGGAAG GACCACGTGT 700 
AGTGTCCGTT GCTCGAGATA CTGTAGGTCG ACAACGGGTC CTTCAGCGAC 750 

15 CTCGACGACC ATCCCCTCTT CGACCAGGAC TTGACGTGGC ACACCCGACT 800 
CAAATTGAGT CCACAGTGGA AACTGACCCT GATGGGTCCC TTCGTCCGTC 850 
TCGCCCCATT CACCCACGGG CTCGCTGCGA GGGTCGTCTG GGTGTGTCTT 900 
GAGAGGTCGT AGGACTGGTA GGTGTTGCAG TCGGTCGTGC TGGACCCGAG 950 
CATACACACG TTCCGGTTGT TGCCGTAGGT CGCTAAAGCC CTCTCGTGGC 1000 
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TCCAGTAACA CGTACTTTTA 6GGAAGTAGT CGCAGCTCAC CGAGTTTCCT 1050 

GGGTAGGACC TCCGGTGCCG TCCTCTGCTC GACCACTTCG ACGGGCACTT 1100 

CGACCGTCGC ATGGGGGGCG GGCTCAAGGT CACCATGTTC CTACCTTTCC 1150 

GTGACAGGCC CGCGGTGTCA GGTGTACGGG ACCACGAGTT CCTCCACTGT 1200 

5 CTCCGGTCGT GTCCGTGGAT GTGGGAGCGG GACACCTTGA GGCGACGACC 1250 

GGACTCCGCG TTGTAGTCGG ACCTCGACCA CCACTTACAC GGGGGGGTCT 1300 

ATGTACTCTT CCTCCGGAGG AGGGGGTCGT AGATGAGCGC AGTGTCGGCG 1350 

GTCCGGGAGT GGACGTGCCG GATGCCCCAC GGGGACGGAG AGTCGTAGGT 1400 

CACCGTGACC GCCGGGACCT GTGGGACGTT CTACAAACGG GTCGCATCAG 1450 

10 AGGCCGCCGC CGTCGTCGTT CTGGAGTACG GTGTCACGGC ACTGACCTCC 1500 
CGCCACTGGT GCGTCCTACG GCACTTGGGG TAGCTCTCGG ACCTGTGGAC 1550 
CTGGCTCAAA CACCTCCCTT TCTTATTCTG ACACTCGTTC GACCACTAGG 1600 
TCTTACGGTT GCACAGACGG TACATGTTCA CACACCAGAG GTTGTTCCAC 1650 
CCGGTCCTAC TCGCCGAGTA GATGAAGATA CACTGGTGGT AGGGGCTGCC 1700 

15 GAAGTGGTAG CTTAGGTTCG GTAGGCTCCT CGATGATCTC CCGGTCGGCC 1750 
ACGAGGACTC GACGGTTCGG CTGTCGATGT TCATGCTCGT AGACGCGACC 1800 
ATGGCGGAGT TGGACAGGTG CGACGTGCTA CGCGTGCCCT TGGGCGAAGA 1850 
CGAGCTGACG TTCTTGCACG TAGACAAGCG GTGGGGAGAC CGGCGGTCGG 1900 
ACCTCCTCCA CCGTGGACCC CGCGCGGTGC GGTGCGAGTC GGACTCATAG 1950 
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G666CGCAGC GCGGGCTCGT GCTCCCGGTG ATACACACGC TTCACGTTCT 2000 
GGCCGCGTCG GTACTGTTCG TGACGGTGTT CTTCATGGAC AGCCACGTCC 2050 
GGGACCTTCG GGGAGCCGAG TGCGTCTTGA ACTGGCTGGA GGACCACTTG 2100 
CACTCGCTGA GCGACCTCTA CGTCACGAAC CACCGGCCTC GCGTGCGCGG 2150 
5 GTCGTAGCAC ACCATGTTTC TGCTCTCCGA CGACCTCCTT TTCAGACCTC 2200 
AGCTGAACCG CCTGAGGTTG GTCTTCGACT CGTAGGTCGC GCACGCGCTC 2250 
CTCCTACGCC CTGCGATAGA CACGTCGCAC ACGTTGCGGT TCCCGACGCA 2300 
GTTGAGGAGG CGGTCGCACC GGCACCTTCC GAGGCTCCTA TTCCCGTCGT 2350 
ACCTCTAGCA CTAGGAACAG CCATGGCCGC AGTAGCGACA GAAGAAGACC 2400 

10 CAGGAGGAGG AGGAGTAGAA GACATTGTAC TCCTCCGGCC GGGTGCGTCT 2450 
GTAGTTCTGC CCGATGGACA GGTAGTAGTA CCTGGGGCCC CTCCACGGAG 2500 
ACCTCCTCGT TACGCTTATG GACAGGATGC TACGGTCGGT CACCCTTAAG 2550 
GGGGCTCTCG CCGACGTGGA CCCCTCTCAC GAGCCGATGC CGCGGAAGCC 2600 
CTTCCACCAC CTTCGGAGGC GAAAGCCGTA GGTGTTCCCG TCGTCGACAC 2650 

15 TGTGGCACCG GCACTTTTAC GACTTTCTCC CGCGGTGCCG GTCGCTCGTG 2700 
GCGCGCGACT ACAGCCTCGA GTTCTAGGAG TAAGTGTAGC CGTTGGTGGA 2750 
GTTGCACCAG TTGGAGGAGC CCCGCACGTG GTTCGGCGTC CCGGGGGAGT 2800 
ACCACTAGCA CCTCAAGACG TTCATGCCGT TGGAGAGGTT GAAGGACGCG 2850 
CGGTTCGCCC TGCGGAAGTC GGGGACGCGC CTCTTCAGAG GGCTCGTCGC 2900 
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GCCTGCGAAG GCGCGGTACC ACCTCGAGCG GTCCGACCTA GCCTCCGCCG 2950 

GCCCCTCGTC GCTGTCCCAG GAGAAGCGCG CCAAGAGCTT CTGGCTCCCG 3000 

CCTCGCTCCG CCCGAAGAGG TCTGGTTCTT CGACTCCTGG ACACCGACTC 3050 

GGGCGACTGG TACCTTCTAG AACAGACGAT GTCGAAGGTC CACCGGTCTC 3100 

5 CCTACCTCAA GGACCGAAGG GCTTTCACGT AGGTGTCTCT GGACCGACGA 3150 

GCCTTGTAAG ACGACAGCCT TTCGCTGCAC CACTTCTAGA CACTGAAACC 3200 

GGAACGGGCC CTGTAGATGT TTCTGGGACT GATGCAGGCG TTCCCGTCAC 3250 

GGGCCGACGG GGACTTCACC TACCGGGGAC TTTCGTAGAA GCTGTTCCAC 3300 

ATGTGGTGCG TCTCACTGCA CACCAGGAAA CCCCACGAAG AGACCCTCTA 3350 

10 GAAGAGAGAC CCCCGGAGGG GCATGGGACC CCACGTCTAG TTACTCCTCA 3400 

AGACGGTCGC CGACTCTCTG CCGTGTTCCT ACTCCCGGGG CCTCGACCGG 3450 

TGAGGGCGGT ATGCGGCGTA GTACGACTTG ACGACCAGGC CTCTGGGGTT 3500 

CCGCTCTGGA CGTAAGAGCC TCGACCACCT CTAGGACCCC CTGGACGAGG 3550 

TCCCGTCCCC GGACGTTCTC CTTCTCCTCC AGACGTACCG GGGCGCGTCG 3600 

15 AGAGTCTCGA GTCTTCTCCC GTCGAAGAGC GTCCACAGGT GGTACCGGGA 3650 

TGTGTAGCGG GTCCGACTGC GACTCCTGTC GGGCGGTTCG GACGTCGCGG 3700 

TGTCGGACCG GCGGTCCATA ATGTTGACCC ACAGGAAAGG GCCCACGGAC 3750 

CGGTCTCCCC GACTCTGGGC ACCAAGGAGG TCCTACTTCT GTAAACTCCT 3800 

TAAGGGGTAC TGGGGTTGCT GGATGTTTCC GAGACACCTG TTGGTCTGTC 3850 
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TGTCACCCTA CCACGACCGG AGCCTCCTCA AACTCGTCTA TCTCTCGTCC 3900 



GTATCTGTTC TTTCGCCGAA GTCCATCGAC TTCGTCTCTC TCTCTTCCGT 3950 
CGTATGCAGT CGTAAAAGAA GAGACGTGAA TATTCTTTCT AGTTTCTGAA 4000 
ATTCTGAAAG CGATAAAGAA GACGATAGAT GATGTTTGAA GTTTCTCCTT 4050 
5 GGTCCTCCGG TTCTCCTCGT ACTTTCACCT GTTCCTCACA CTGGTGACTT 4100 
CGTGGTGTCC CTCCCCAATC CGGAGGCCTA CTGACGCCCG TCCGGACCTA 4150 
TTATAGGTCG GAGGGTGTTC TTCGACCACC TCGTCTCACA AGGGACTGAG 4200 
GAGGTTCCTT TCCCTCTGCG GGAAAGTACC AGACGACTCA TTGTCCACGG 4250 
AAGGGTCTGT GACCGCAATG ACGAACTGGT TTCTCGGGAG TTCGCCGGGA 4300 
10 ATACGGTCGC ACTGTCTCCC GAGTGGAGAA CGGAAGATCC AGTGAAGAGT 4350 
GTTACAGGGA AGTCGTGGAC TGGGACACGG GCGGTCAATA AGGAACCATT 4400 
ATACTCATTA TGTAGTTTCT CATCA 4425 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 1298 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Gin Arg Gly Ala Ala Leu Cys Leu Arg Leu Trp Leu Cys Leu 
20 1 5 10 15 

Gly Leu Leu Asp Gly Leu Val Ser Gly Tyr Ser Met Thr Pro Pro 
20 25 30 

Thr Leu Asn He Thr Glu Glu Ser His Val He Asp Thr Gly Asp 
35 40 45 

25 Ser Leu Ser He Ser Cys Arg Gly Gin His Pro Leu Glu Trp Ala 

50 55 60 
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Trp Pro Gly Ala Gin Glu Ala Pro Ala Thr Gly Asp Lys Asp Ser 
65 70 75 

Glu Asp Thr Gly Val Val Arg Asp Cys Glu Gly Thr Asp Ala Arg 
80 85 90 

5 Pro Tyr Cys Lys Val Leu Leu Leu His Glu Val His Ala Asn Asp 

95 100 105 

Thr Gly Ser Tyr Val Cys Tyr Tyr Lys Tyr lie Lys Ala Arg lie 
110 115 120 

Glu Gly Thr Thr Ala Ala Ser Ser Tyr Val Phe Val Arg Asp Phe 
10 125 130 135 

Glu Gin Pro Phe lie Asn Lys Pro Asp Thr Leu Leu Val Asn Arg 
140 145 150 

Lys Asp Ala Met Trp Val Pro Cys Leu Val Ser lie Pro Gly Leu 
155 160 165 

15 Asn Val Thr Leu Arg Ser Gin Ser Ser Val Leu Trp Pro Asp Gly 

170 175 180 

Gin Glu Val Val Trp Asp Asp Arg Arg Gly Met Leu Val Ser Thr 
185 190 195 

Pro Leu Leu His Asp Ala Leu Tyr Leu Gin Cys Glu Thr Thr Trp 
20 200 205 210 

Gly Asp Gin Asp Phe Leu Ser Asn Pro Phe Leu Val His lie Thr 
215 220 225 

Gly Asn Glu Leu Tyr Asp lie Gin Leu Leu Pro Arg Lys Ser Leu 
230 235 240 

25 Glu Leu Leu Val Gly Glu Lys Leu Val Leu Asn Cys Thr Val Trp 

245 250 255 

Ala Glu Phe Asn Ser Gly Val Thr Phe Asp Trp Asp Tyr Pro Gly 
260 265 270 

Lys Gin Ala Glu Arg Gly Lys Trp Val Pro Glu Arg Arg Ser Gin 
30 275 280 285 

Gin Thr His Thr Glu Leu Ser Ser lie Leu Thr lie His Asn Val 
290 295 300 

Ser Gin His Asp Leu Gly Ser Tyr Val Cys Lys Ala Asn Asn Gly 
305 310 315 

35 lie Gin Arg Phe Arg Glu Ser Thr Glu Val He Val His Glu Asn 

320 325 330 

Pro Phe He Ser Val Glu Trp Leu Lys Gly Pro He Leu Glu Ala 
335 340 345 
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Thr Ala Gly Asp Glu Leu Val Lys Leu Pro Val Lys Leu Ala Ala 
350 355 360 

Tyr Pro Pro Pro Glu Phe Gin Trp Tyr Lys Asp Gly Lys Ala Leu 
365 370 375 

5 Ser Gly Arg His Ser Pro His Ala Leu Val Leu Lys Glu Val Thr 
380 385 390 

Glu Ala Ser Thr Gly Thr Tyr Thr Leu Ala Leu Trp Asn Ser Ala 
395 400 405 

Ala Gly Leu Arg Arg Asn He Ser Leu Glu Leu Val Val Asn Val 
10 410 415 420 

Pro Pro Gin He His Glu Lys Glu Ala Ser Ser Pro Ser He Tyr 
425 430 435 

Ser Arg His Ser Arg Gin Ala Leu Thr Cys Thr Ala Tyr Gly Val 
440 445 450 

15 Pro Leu Pro Leu Ser He Gin Trp His Trp Arg Pro Trp Thr Pro 

455 460 465 

Cys Lys Met Phe Ala Gin Arg Ser Leu Arg Arg Arg Gin Gin Gin 
470 475 480 

Asp Leu Met Pro Gin Cys Arg Asp Trp Arg Ala Val Thr Thr Gin 
20 485 490 495 

Asp Ala Val Asn Pro He Glu Ser Leu Asp Thr Trp Thr Glu Phe 
500 505 510 

Val Glu Gly Lys Asn Lys Thr Val Ser Lys Leu Val He Gin Asn 
515 520 525 

25 Ala Asn Val Ser Ala Met Tyr Lys Cys Val Val Ser Asn Lys Val 

530 535 540 

Gly Gin Asp Glu Arg Leu He Tyr Phe Tyr Val Thr Thr He Pro 
545 550 555 

Asp Gly Phe Thr He Glu Ser Lys Pro Ser Glu Glu Leu Leu Glu 
30 560 565 570 

Gly Gin Pro Val Leu Leu Ser Cys Gin Ala Asp Ser Tyr Lys Tyr 
575 580 585 

Glu His Leu Arg Trp Tyr Arg Leu Asn Leu Ser Thr Leu His Asp 
590 595 600 

35 Ala His Gly Asn Pro Leu Leu Leu Asp Cys Lys Asn Val His Leu 

605 610 615 

Phe Ala Thr Pro Leu Ala Ala Ser Leu Glu Glu Val Ala Pro Gly 
620 625 630 
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Ala Arg His Ala Thr Leu Ser Leu Ser lie Pro Arg Val Ala Pro 
635 640 645 

Glu His Glu Gly His Tyr Val Cys Glu Val Gin Asp Arg Arg Ser 
650 655 660 

5 His Asp Lys His Cys His Lys Lys Tyr Leu Ser Val Gin Ala Leu 
665 670 675 

Glu Ala Pro Arg Leu Thr Gin Asn Leu Thr Asp Leu Leu Val Asn 
680 685 690 

Val Ser Asp Ser Leu Glu Met Gin Cys Leu Val Ala Gly Ala His 
10 695 700 705 

Ala Pro Ser lie Val Trp Tyr Lys Asp Glu Arg Leu Leu Glu Glu 
710 715 720 

Lys Ser Gly Val Asp Leu Ala Asp Ser Asn Gin Lys Leu Ser lie 
725 730 735 

15 Gin Arg Val Arg Glu Glu Asp Ala Gly Arg Tyr Leu Cys Ser Val 

740 745 750 

Cys Asn Ala Lys Gly Cys Val Asn Ser Ser Ala Ser Val Ala Val 
755 760 765 

Glu Gly Ser Glu Asp Lys Gly Ser Met Glu He Val He Leu Val 
20 770 775 780 

Gly Thr Gly Val He Ala Val Phe Phe Trp Val Leu Leu Leu Leu 
785 790 795 

He Phe Cys Asn Met Arg Arg Pro Ala His Ala Asp He Lys Thr 
800 805 810 

25 Gly Tyr Leu Ser He He Met Asp Pro Gly Glu Val Pro Leu Glu 

815 820 825 

Glu Gin Cys Glu Tyr Leu Ser Tyr Asp Ala Ser Gin Trp Glu Phe 
830 835 840 

Pro Arg Glu Arg Leu His Leu Gly Arg Val Leu Gly Tyr Gly Ala 
30 845 850 855 

Phe Gly Lys Val Val Glu Ala Ser Ala Phe Gly He His Lys Gly 
860 865 870 

Ser Ser Cys Asp Thr Val Ala Val Lys Met Leu Lys Glu Gly Ala 
875 880 885 

35 Thr Ala Ser Glu His Arg Ala Leu Met Ser Glu Leu Lys He Leu 

890 695 900 

He His He Gly Asn His Leu Asn Val Val Asn Leu Leu Gly Ala 
905 910 915 
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Cys Thr Lys Pro Gin Gly Pro Leu Met Val He Val Glu Phe Cys 
920 925 930 

Lys Tyr Gly Asn Leu Ser Asn Phe Leu Arg Ala Lys Arg Asp Ala 
935 940 945 

5 Phe Ser Pro Cys Ala Glu Lys Ser Pro Glu Gin Arg Gly Arg Phe 
950 955 960 

Arg Ala Met Val Glu Leu Ala Arg Leu Asp Arg Arg Arg Pro Gly 
965 970 975 

Ser Ser Asp Arg Val Leu Phe Ala Arg Phe Ser Lys Thr Glu Gly 
10 980 985 990 

Gly Ala Arg Arg Ala Ser Pro Asp Gin Glu Ala Glu Asp Leu Trp 
995 1000 1005 

Leu Ser Pro Leu Thr Met Glu Asp Leu Val Cys Tyr Ser Phe Gin 
1010 1015 1020 

15 Val Ala Arg Gly Met Glu Phe Leu Ala Ser Arg Lys Cys He His 
1025 1030 1035 

Arg Asp Leu Ala Ala Arg Asn He Leu Leu Ser Glu Ser Asp Val 
1040 1045 1050 

Val Lys He Cys Asp Phe Gly Leu Ala Arg Asp He Tyr Lys Asp 
20 1055 1060 1065 

Pro Asp Tyr Val Arg Lys Gly Ser Ala Arg Leu Pro Leu Lys Trp 
1070 1075 1080 

Met Ala Pro Glu Ser He Phe Asp Lys Val Tyr Thr Thr Gin Ser 
1085 1090 1095 

25 Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu He Phe Ser Leu 
1100 1105 1110 

Gly Ala Ser Pro Tyr Pro Gly Val Gin He Asn Glu Glu Phe Cys 
1115 1120 1125 

Gin Arg Leu Arg Asp Gly Thr Arg Met Arg Ala Pro Glu Leu Ala 
30 1130 1135 1140 

Thr Pro Ala He Arg Arg He Met Leu Asn Cys Trp Ser Gly Asp 
1145 1150 1155 

Pro Lys Ala Arg Pro Ala Phe Ser Glu Leu Val Glu He Leu Gly 
1160 1165 1170 

35 Asp Leu Leu Gin Gly Arg Gly Leu Gin Glu Glu Glu Glu Val Cys 
1175 1180 1185 

Met Ala Pro Arg Ser Ser Gin Ser Ser Glu Glu Gly Ser Phe Ser 
1190 1195 1200 



-87- 



WO 95/27061 



PCT/US95/04228 



Gin Val Ser Thr Met Ala Leu His He Ala Gin Ala Asp Ala Glu 
1205 1210 1215 

Asp Ser Pro Pro Ser Leu Gin Arg His Ser Leu Ala Ala Arg Tyr 
1220 1225 1230 

5 Tyr Asn Trp Val Ser Phe Pro Gly Cys Leu Ala Arg Gly Ala Glu 
1235 1240 1245 

Thr Arg Gly Ser Ser Arg Met Lys Thr Phe Glu Glu Phe Pro Met 
1250 1255 1260 

Thr Pro Thr Thr Tyr Lys Gly Ser Val Asp Asn Gin Thr Asp Ser 
10 1265 1270 1275 

Gly Met Val Leu Ala Ser Glu Glu Phe Glu Gin He Glu Ser Arg 
1280 1285 1290 

His Arg Gin Glu Ser Gly Phe Arg 
1295 1298 

15 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3348 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



ATGGCTGGGA TTTTCTATTT CGCCCTATTT TCGTGTCTCT TCGGGATTTG 50 



CGACGCTGTC ACAGGTTCCA GGGTATACCC CGCGAATGAA GTTACCTTAT 100 



TGGATTCCAG ATCTGTTCAG GGAGAACTTG GGTGGATAGC AAGCCCTCTG 150 



25 GAAGGAGGGT GGGAGGAAGT GAGTATCATG GATGAAAAAA ATACACCAAT 200 



CCGAACCTAC CAAGTGTGCA ATGTGATGGA ACCCAGCCAG AATAACTGGC 250 



TACGAACTGA TTGGATCACC CGAGAAGGGG CTCAGAGGGT GTATATTGAG 300 



ATTAAATTCA CCTTGAGGGA CTGCAATAGT CTTCCGGGCG TCATGGGGAC 350 



TTGCAAGGAG ACGTTTAACC TGTACTACTA TGAATCAGAC AACGACAAAG 400 



30 AGCGTTTCAT CAGAGAGAAC CAGTTTGTCA AAATTGACAC CATTGCTGCT 450 
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GATGAGAGCT TCACCCAAGT GGACATTGGT GACAGAATCA TGAAGCTGAA 500 
CACCGAGATC CGGGATGTAG GGCCATTAAG CAAAAAGGGG TTTTACCTGG 550 
CTTTTCAGGA TGTGGGGGCC TGCATCGCCC TGGTATCAGT CCGTGTGTTC 600 
TATAAAAAGT GTCCACTCAC AGTCCGCAAT CTGGCCCAGT TTCCTGACAC 650 
5 CATCACAGGG GCTGATACGT CTTCCCTGGT GGAAGTTCGA GGCTCCTGTG 700 
TCAACAACTC AGAAGAGAAA GATGTGCCAA AAATGTACTG TGGGGCAGAT 750 
GGTGAATGGC TGGTACCCAT TGGCAACTGC CTATGCAACG CTGGGCATGA 800 
GGAGCGGAGC GGAGAATGCC AAGCTTGCAA AATTGGATAT TACAAGGCTC 850 
TCTCCACGGA TGCCACCTGT GCCAAGTGCC CACCCCACAG CTACTCTGTC 900 
10 TGGGAAGGAG CCACCTCGTG CACCTGTGAC CGAGGCTTTT TCAGAGCTGA 950 
CAACGATGCT GCCTCTATGC CCTGCACCCG TCCACCATCT GCTCCCCTGA 1000 
ACTTGATTTC AAATGTCAAC GAGACATCTG TGAACTTGGA ATGGAGTAGC 1050 
CCTCAGAATA CAGGTGGCCG CCAGGACATT TCCTATAATG TGGTATGCAA 1100 
GAAATGTGGA GCTGGTGACC CCAGCAAGTG CCGACCCTGT GGAAGTGGGG 1150 
15 TCCACTACAC CCCACAGCAG AATGGCTTGA AGACCACCAA AGGCTCCATC 1200 
ACTGACCTCC TAGCTCATAC CAATTACACC TTTGAAATCT GGGCTGTGAA 1250 
TGGAGTGTCC AAATATAACC CTAACCCAGA CCAATCAGTT TCTGTCACTG 1300 
TGACCACCAA CCAAGCAGCA CCATCATCCA TTGCTTTGGT CCAGGCTAAA 1350 
GAAGTCACAA GATACAGTGT GGCACTGGCT TGGCTGGAAC CAGATCGGCC 1400 
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CAATGGGGTA ATCCTGGAAT ATGAAGTCAA GTATTATGAG AAGGATCAGA 1450 

ATGAGCGAAG CTATCGTATA GTTCGGACAG CTGCCAGGAA CACAGATATC 1500 
AAAGGCCTGA ACCCTCTCAC TTCCTATGTT TTCCACGTGC GAGCCAGGAC 1550 
AGCAGCTGGC TATGGAGACT TCAGTGAGCC CTTGGAGGTT ACAACCAACA 1600 
5 CAGTGCCTTC CCGGATCATT GGAGATGGGG CTAACTCCAC AGTCCTTCTG 1650 
GTCTCTGTCT CGGGCAGTGT GGTGCTGGTG GTAATTCTCA TTGCAGCTTT 1700 
TGTCATCAGC CGGAGACGGA GTAAATACAG TAAAGCCAAA CAAGAAGCGG 1750 
ATGAAGAGAA ACATTTGAAT CAAGGTGTAA GAACATATGT GGACCCCTTT 1800 
ACGTACGAAG ATCCCAACCA AGCAGTGCGA GAGTTTGCCA AAGAAATTGA 1850 

10 CGCATCCTGC ATTAAGATTG AAAAAGTTAT AGGAGTTGGT GAATTTGGTG 1900 
AGGTATGCAG TGGGCGTCTC AAAGTGCCTG GCAAGAGAGA GATCTGTGTG 1950 
GCTATCAAGA CTCTGAAAGC TGGTTATACA GACAAACAGA GGAGAGACTT 2000 
CCTGAGTGAG GCCAGCATCA TGGGACAGTT TGACCATCCG AACATCATTC 2050 
ACTTGGAAGG CGTGGTCACT AAATGTAAAC CAGTAATGAT CATAACAGAG 2100 

15 TACATGGAGA ATGGCTCCTT GGATGCATTC CTCAGGAAAA ATGATGGCAG 2150 
ATTTACAGTC ATTCAGCTGG TGGGCATGCT TCGTGGCATT GGGTCTGGGA 2200 
TGAAGTATTT ATCTGATATG AGCTATGTGC ATCGTGATCT GGCCGCACGG 2250 
AACATCCTGG TGAACAGCAA CTTGGTCTGC AAAGTGTCTG ATTTTGGCAT 2300 
GTCCCGAGTG CTTGAGGATG ATCCGGAAGC AGCTTACACC ACCAGGGGTG 2350 
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GCAAGATTCC TATCCGGTGG ACTGCGCCAG AAGCAATTGC CTATCGTAAA 2400 

TTCACATCAG CAAGTGATGT ATGGAGCTAT GGAATCGTTA TGTGGGAAGT 2450 

GATGTCGTAC GGGGAGAGGC CCTATTGGGA TATGTCCAAT CAAGATGTGA 2500 

TTAAAGCCAT TGAGGAAGGC TATCGGTTAC CCCCTCCAAT GGACTGCCCC 2550 

5 ATTGCGCTCC ACCAGCTGAT GCTAGACTGC TGGCAGAAGG AGAGGAGCGA 2600 

CAGGCCTAAA TTTGGGCAGA TTGTCAACAT GTTGGACAAA CTCATCCGCA 2650 

ACCCCAACAG CTTGAAGAGG ACAGGGACGG AGAGCTCCAG ACCTAACACT 2700 

GCCTTGTTGG ATCCAAGCTC CCCTGAATTC TCTGCTGTGG TATCAGTGGG 2750 

CGATTGGCTC CAGGCCATTA AAATGGACCG GTATAAGGAT AACTTCACAG 2800 

10 CTGCTGGTTA TACCACACTA GAGGCTGTGG TGCACGTGAA CCAGGAGGAC 2850 

CTGGCAAGAA TTGGTATCAC AGCCATCACA CACCAGAATA AGATTTTGAG 2900 

CAGTGTCCAG GCAATGCGAA CCCAAATGCA GCAGATGCAC GGCAGAATGG 2950 

TTCCCGTCTG AGCCAGTACT GAATAAACTC AAAACTCTTG AAATTAGTTT 3000 

ACCTCATCCA TGCACTTTAA TTGAAGAACT GCACTTTTTT TACTTCGTCT 3050 

15 TCGCCCTCTG AAATTAAAGA AATGAAAAAA AAAAAACAAT ATCTGCAGCG 3100 

TTGCTTGGTG CACAGATTGC TGAAACTGTG GGGCTTACAG AAATGACTGC 3150 

CGGTCATTTG AATGAGACCT GGAACAAATC GTTTCTCAGA AGTACTTTTC 3200 

TGTTCATCAC CAGTCTGTAA AATACATGTA CCTATAGAAA TAGAACACTG 3250 

CCTCTGAGTT TTGATGCTGT ATTTGCTGCC AGACACTGAG CTTCTGAGAC 3300 
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ATCCCTGATT CTCTCTCCAT TTGGAATTAC AACGGTCGAC GAGCTCGA 3348 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3348 bases 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TACCGACCCT AAAAGATAAA GCGGGATAAA AGCACAGAGA AGCCCTAAAC 50 
10 GCTGCGACAG TGTCCAAGGT CCCATATGGG GCGCTTACTT CAATGGAATA 100 
ACCTAAGGTC TAGACAAGTC CCTCTTGAAC CCACCTATCG TTCGGGAGAC 150 
CTTCCTCCCA CCCTCCTTCA CTCATAGTAC CTACTTTTTT TATGTGGTTA 200 
GGCTTGGATG GTTCACACGT TACACTACCT TGGGTCGGTC TTATTGACCG 250 
ATGCTTGACT AACCTAGTGG GCTCTTCCCC GAGTCTCCCA CATATAACTC 300 
15 TAATTTAAGT GGAACTCCCT GACGTTATCA GAAGGCCCGC AGTACCCCTG 350 
AACGTTCCTC TGCAAATTGG ACATGATGAT ACTTAGTCTG TTGCTGTTTC 400 
TCGCAAAGTA GTCTCTCTTG GTCAAACAGT TTTAACTGTG GTAACGACGA 450 
CTACTCTCGA AGTGGGTTCA CCTGTAACCA CTGTCTTAGT ACTTCGACTT 500 
GTGGCTCTAG GCCCTACATC CCGGTAATTC GTTTTTCCCC AAAATGGACC 550 
20 GAAAAGTCCT ACACCCCCGG ACGTAGCGGG ACCATAGTCA GGCACACAAG 600 
ATATTTTTCA CAGGTGAGTG TCAGGCGTTA GACCGGGTCA AAGGACTGTG 650 
GTAGTGTCCC CGACTATGCA GAAGGGACCA CCTTCAAGCT CCGAGGACAC 700 
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AGTTGTTGAG TCTTCTCTTT CTACACGGTT TTTACATGAC ACCCCGTCTA 750 

CCACTTACCG ACCATGGGTA ACCGTTGACG GATACGTTGC GACCCGTACT 800 

CCTCGCCTCG CCTCTTACGG TTCGAACGTT TTAACCTATA ATGTTCCGAG 850 

AGAGGTGCCT ACGGTGGACA CGGTTCACGG GTGGGGTGTC GATGAGACAG 900 

5 ACCCTTCCTC GGTGGAGCAC GTGGACACTG GCTCCGAAAA AGTCTCGACT 950 

GTTGCTACGA CGGAGATACG GGACGTGGGC AGGTGGTAGA CGAGGGGACT 1000 

TGAACTAAAG TTTACAGTTG CTCTGTAGAC ACTTGAACCT TACCTCATCG 1050 

GGAGTCTTAT GTCCACCGGC GGTCCTGTAA AGGATATTAC ACCATACGTT 1100 

CTTTACACCT CGACCACTGG GGTCGTTCAC GGCTGGGACA CCTTCACCCC 1150 

10 AGGTGATGTG GGGTGTCGTC TTACCGAACT TCTGGTGGTT TCCGAGGTAG 1200 

TGACTGGAGG ATCGAGTATG GTTAATGTGG AAACTTTAGA CCCGACACTT 1250 

ACCTCACAGG TTTATATTGG GATTGGGTCT GGTTAGTCAA AGACAGTGAC 1300 

ACTGGTGGTT GGTTCGTCGT GGTAGTAGGT AACGAAACCA GGTCCGATTT 1350 

CTTCAGTGTT CTATGTCACA CCGTGACCGA ACCGACCTTG GTCTAGCCGG 1400 

15 GTTACCCCAT TAGGACCTTA TACTTCAGTT CATAATACTC TTCCTAGTCT 1450 

TACTCGCTTC GATAGCATAT CAAGCCTGTC GACGGTCCTT GTGTCTATAG 1500 

TTTCCGGACT TGGGAGAGTG AAGGATACAA AAGGTGCACG CTCGGTCCTG 1550 

TCGTCGACCG ATACCTCTGA AGTCACTCGG GAACCTCCAA TGTTGGTTGT 1600 

GTCACGGAAG GGCCTAGTAA CCTCTACCCC GATTGAGGTG TCAGGAAGAC 1650 
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CAGAGACAGA GCCCGTCACA CCACGACCAC CATTAAGAGT AACGTCGAAA 1700 
ACAGTAGTCG GCCTCTGCCT CATTTATGTC ATTTCGGTTT GTTCTTCGCC 1750 
TACTTCTCTT TGTAAACTTA GTTCCACATT CTTGTATACA CCTGGGGAAA 1800 
TGCATGCTTC TAGGGTTGGT TCGTCACGCT CTCAAACGGT TTCTTTAACT 1850 
5 GCGTAGGACG TAATTCTAAC TTTTTCAATA TCCTCAACCA CTTAAACCAC 1900 
TCCATACGTC ACCCGCAGAG TTTCACGGAC CGTTCTCTCT CTAGACACAC 1950 
CGATAGTTCT GAGACTTTCG ACCAATATGT CTGTTTGTCT CCTCTCTGAA 2000 
GGACTCACTC CGGTCGTAGT ACCCTGTCAA ACTGGTAGGC TTGTAGTAAG 2050 
TGAACCTTCC GCACCAGTGA TTTACATTTG GTCATTACTA GTATTGTCTC 2100 
10 ATGTACCTCT TACCGAGGAA CCTACGTAAG GAGTCCTTTT TACTACCGTC 2150 
TAAATGTCAG TAAGTCGACC ACCCGTACGA AGCACCGTAA CCCAGACCCT 2200 
ACTTCATAAA TAGACTATAC TCGATACACG TAGCACTAGA CCGGCGTGCC 2250 
TTGTAGGACC ACTTGTCGTT GAACCAGACG TTTCACAGAC TAAAACCGTA 2300 
CAGGGCTCAC GAACTCCTAC TAGGCCTTCG TCGAATGTGG TGGTCCCCAC 2350 
15 CGTTCTAAGG ATAGGCCACC TGACGCGGTC TTCGTTAACG GATAGCATTT 2400 
AAGTGTAGTC GTTCACTACA TACCTCGATA CCTTAGCAAT ACACCCTTCA 2450 
CTACAGCATG CCCCTCTCCG GGATAACCCT ATACAGGTTA GTTCTACACT 2500 
AATTTCGGTA ACTCCTTCCG ATAGCCAATG GGGGAGGTTA CCTGACGGGG 2550 

TAACGCGAGG TGGTCGACTA CGATCTGACG ACCGTCTTCC TCTCCTCGCT 2600 

> 
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GTCCGGATTT AAACCCGTCT AACAGTTGTA CAACCTGTTT GAGTAGGCGT 2650 
TGGGGTTGTC GAACTTCTCC TGTCCCTGCC TCTCGAGGTC TGGATTGTGA 2700 
CGGAACAACC TAGGTTCGAG GGGACTTAAG AGACGACACC ATAGTCACCC 2750 
GCTAACCGAG GTCCGGTAAT TTTACCTGGC CATATTCCTA TTGAAGTGTC 2800 
5 GACGACCAAT ATGGTGTGAT CTCCGACACC ACGTGCACTT GGTCCTCCTG 2850 
GACCGTTCTT AACCATAGTG TCGGTAGTGT GTGGTCTTAT TCTAAAACTC 2900 
GTCACAGGTC CGTTACGCTT GGGTTTACGT CGTCTACGTG CCGTCTTACC 2950 
AAGGGCAGAC TCGGTCATGA CTTATTTGAG TTTTGAGAAC TTTAATCAAA 3000 
TGGAGTAGGT ACGTGAAATT AACTTCTTGA CGTGAAAAAA ATGAAGCAGA 3050 
10 AGCGGGAGAC TTTAATTTCT TTACTTTTTT TTTTTTGTTA TAGACGTCGC 3100 
AACGAACCAC GTGTCTAACG ACTTTGACAC CCCGAATGTC TTTACTGACG 3150 
GCCAGTAAAC TTACTCTGGA CCTTGTTTAG CAAAGAGTCT TCATGAAAAG 3200 
ACAAGTAGTG GTCAGACATT TTATGTACAT GGATATCTTT ATCTTGTGAC 3250 
GGAGACTCAA AACTACGACA TAAACGACGG TCTGTGACTC GAAGACTCTG 3300 
15 TAGGGACTAA GAGAGAGGTA AACCTTAATG TTGCCAGCTG CTCGAGCT 3348 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ala Gly He Phe Tyx Phe Ala Leu Phe Ser Cys Leu Phe Gly 
1 5 10 15 
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lie Cys Asp Ala Val Thr Gly Ser Arg Val Tyr Pro Ala Asn Glu 
20 25 30 

Val Thr Leu Leu Asp Ser Arg Ser Val Gin Gly Glu Leu Gly Trp 
35 40 45 

5 lie Ala Ser Pro Leu Glu Gly Gly Trp Glu Glu Val Ser He Met 

50 55 60 

Asp Glu Lys Asn Thr Pro He Arg Thr Tyr Gin Val Cys Asn Val 
65 70 75 

Met Glu Pro Ser Gin Asn Asn Trp Leu Arg Thr Asp Trp He Thr 
10 80 85 90 

Arg Glu Gly Ala Gin Arg Val Tyr He Glu He Lys Phe Thr Leu 
95 100 105 

Arg Asp Cys Asn Ser Leu Pro Gly Val Met Gly Thr Cys Lys Glu 
110 115 120 

15 Thr Phe Asn Leu Tyr Tyr Tyr Glu Ser Asp Asn Asp Lys Glu Arg 

125 130 135 

Phe He Arg Glu Asn Gin Phe Val Lys lie Asp Thr He Ala Ala 
140 145 150 

Asp Glu Ser Phe Thr Gin Val Asp He Gly Asp Arg He Met Lys 
20 155 160 165 

Leu Asn Thr Glu He Arg Asp Val Gly Pro Leu Ser Lys Lys Gly 
170 175 180 

Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys He Ala Leu Val 
185 190 195 

25 Ser Val Arg Val Phe Tyr Lys Lys Cys Pro Leu Thr Val Arg Asn 

200 205 210 

Leu Ala Gin Phe Pro Asp Thr lie Thr Gly Ala Asp Thr Ser Ser 
215 220 225 

Leu Val Glu Val Arg Gly Ser Cys Val Asn Asn Ser Glu Glu Lys 
30 230 235 240 

Asp Val Pro Lys Met Tyr Cys Gly Ala Asp Gly Glu Trp Leu Val 
245 250 255 

Pro He Gly Asn Cys Leu Cys Asn Ala Gly His Glu Glu Arg Ser 
260 265 270 

35 Gly Glu Cys Gin Ala Cys Lys He Gly Tyr Tyr Lys Ala Leu Ser 

275 280 285 

Thr Asp Ala Thr Cys Ala Lys Cys Pro Pro His Ser Tyr Ser Val 
290 295 300 
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Trp Glu Gly Ala Thr Ser Cys Thr Cys Asp Arg Gly Phe Phe Arg 
305 310 315 

Ala Asp Asn Asp Ala Ala Ser Met Pro Cys Thr Arg Pro Pro Ser 
320 325 330 

5 Ala Pro Leu Asn Leu lie Ser Asn Val Asn Glu Thr Ser Val Asn 
335 340 345 

Leu Glu Trp Ser Ser Pro Gin Asn Thr Gly Gly Arg Gin Asp He 
350 355 360 

Ser Tyr Asn Val Val Cys Lys Lys Cys Gly Ala Gly Asp Pro Ser 
10 365 370 375 

Lys Cys Arg Pro Cys Gly Ser Gly Val His Tyr Thr Pro Gin Gin 
380 385 390 

Asn Gly Leu Lys Thr Thr Lys Gly Ser He Thr Asp Leu Leu Ala 
395 400 405 

15 His Thr Asn Tyr Thr Phe Glu He Trp Ala Val Asn Gly Val Ser 

410 415 420 

Lys Tyr Asn Pro Asn Pro Asp Gin Ser Val Ser Val Thr Val Thr 
425 430 435 

Thr Asn Gin Ala Ala Pro Ser Ser He Ala Leu Val Gin Ala Lys 
20 440 445 450 

Glu Val Thr Arg Tyr Ser Val Ala Leu Ala Trp Leu Glu Pro Asp 
455 460 465 

Arg Pro Asn Gly Val He Leu Glu Tyr Glu Val Lys Tyr Tyr Glu 
470 475 480 

25 Lys Asp Gin Asn Glu Arg Ser Tyr Arg He Val Arg Thr Ala Ala 

485 490 495 

Arg Asn Thr Asp He Lys Gly Leu Asn Pro Leu Thr Ser Tyr Val 
500 505 510 

Phe His Val Arg Ala Arg Thr Ala Ala Gly Tyr Gly Asp Phe Ser 
30 515 520 525 

Glu Pro Leu Glu Val Thr Thr Asn Thr Val Pro Ser Arg He He 
530 535 540 

Gly Asp Gly Ala Asn Ser Thr Val Leu Leu Val Ser Val Ser Gly 
545 550 555 

35 Ser Val Val Leu Val Val He Leu He Ala Ala Phe Val He Ser 

560 565 570 

Arg Arg Arg Ser Lys Tyr Ser Lys Ala Lys Gin Glu Ala Asp Glu 
575 580 585 
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Glu Lys His Leu Asn Gin Gly Val Arg Thr Tyr Val Asp Pro Phe 
590 595 600 

Thr Tyr Glu Asp Pro Asn Gin Ala Val Arg Glu Phe Ala Lys Glu 
605 610 615 

5 He Asp Ala Ser Cys He Lys He Glu Lys Val He Gly Val Gly 
620 625 630 

Glu Phe Gly Glu Val Cys Ser Gly Arg Leu Lys Val Pro Gly Lys 
635 640 645 

Arg Glu He Cys Val Ala He Lys Thr Leu Lys Ala Gly Tyr Thr 
10 650 655 660 

Asp Lys Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser He Met Gly 
665 670 675 

Gin Phe Asp His Pro Asn He He His Leu Glu Gly Val Val Thr 
680 685 690 

15 Lys Cys Lys Pro Val Met He He Thr Glu Tyr Met Glu Asn Gly 

695 700 705 

Ser Leu Asp Ala Phe Leu Arg Lys Asn Asp Gly Arg Phe Thr Val 
710 715 720 

He Gin Leu Val Gly Met Leu Arg Gly He Gly Ser Gly Met Lys 
20 725 730 735 

Tyr Leu Ser Asp Met Ser Tyr Val His Arg Asp Leu Ala Ala Arg 
740 745 750 

Asn He Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe 
755 760 765 

25 Gly Met Ser Arg Val Leu Glu Asp Asp Pro Glu Ala Ala Tyr Thr 

770 775 780 

Thr Arg Gly Gly Lys He Pro He Arg Trp Thr Ala Pro Glu Ala 
785 790 795 

He Ala Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr 
30 800 805 810 

Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr 
815 820 625 

Trp Asp Met Ser Asn Gin Asp Val He Lys Ala He Glu Glu Gly 
830 835 840 

35 Tyr Arg Leu Pro Pro Pro Met Asp Cys Pro lie Ala Leu His Gin 

845 850 855 

Leu Met Leu Asp Cys Trp Gin Lys Glu Arg Ser Asp Arg Pro Lys 
860 865 870 
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Phe Gly Gin lie Val Asn Met Leu Asp Lys Leu lie Arg Asn Pro 
875 880 885 

Asn Ser Leu Lys Arg Thr Gly Thr Glu Ser Ser Arg Pro Asn Thr 
890 895 900 

5 Ala lieu Leu Asp Pro Ser Ser Pro Glu Phe Ser Ala Val Val Ser 
905 910 915 

Val Gly Asp Trp Leu Gin Ala lie Lys Met Asp Arg Tyr Lys Asp 
920 925 930 

Asn Phe Thr Ala Ala Gly Tyr Thr Thr Leu Glu Ala Val Val His 
10 935 940 945 

Val Asn Gin Glu Asp Leu Ala Arg He Gly He Thr Ala He Thr 
950 955 960 

His Gin Asn Lys He Leu Ser Ser Val Gin Ala Met Arg Thr Gin 
965 970 975 

15 Met Gin Gin Met His Gly Arg Met Val Pro Val Ala Ser Thr Glu 

980 985 990 

Thr Gin Asn Ser Asn Phe Thr Ser Ser Met His Phe Asn Arg Thr 
995 1000 1005 

Ala Leu Phe Leu Leu Arg Leu Arg Pro Leu Lys Leu Lys Lys Lys 
20 1010 1015 1020 

Lys Lys Asn Asn He Cys Ser Val Ala Trp Cys Thr Asp Cys Asn 
1025 1030 1035 

Cys Gly Ala Tyr Arg Asn Asp Cys Arg Ser Phe Glu Asp Leu Glu 
1040 1045 1050 

25 Gin He Val Ser Gin Lys Tyr Phe Ser Val His His Gin Ser Val 
1055 1060 1065 

Lys Tyr Met Tyr Leu Lys Asn Thr Ala Ser Glu Phe Cys Cys He 
1070 1075 1080 

Cys Cys Gin Thr Leu Ser Phe Asp He Pro Asp Ser Leu Ser He 
30 1085 1090 1095 

Trp Asn Tyr Asn Gly Arg Arg Ala Arg 
1100 1104 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS; 
35 (A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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TCGGATCCAC ACGNGACTCT TGGC 24 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 bases 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



TCGGATCCAC TCAGNGACTC TTNGCNGC 28 



10 (2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 



CTCGAATTCC AGATAAGCGT ACCAGCACAG TC 32 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 



25 CTCGAATTCC AGATATCCGT ACCATAACAG TC 32 



(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

Met Asp Tyr Lys Asp Asp Asp Asp Lys Lys Leu Ala Met 
15 10 13 

(2) INFORMATION FOR SEQ ID NO: 42: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



CCGGATATCA TGGACTACAA GGACGACGAT GACAAGAAGC TTGCCATGGA 50 



GCTC 54 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 22 bases 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43: 



20 AGGCTGCTGG AGGAAAAGTC TG 22 



(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



GGAGGGTGAC CTCCATGCTG CCCTTATCCT CG 32 



(2) INFORMATION FOR SEQ ID NO:45: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9108 bases 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
5 TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT 50 
TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 100 
TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 150 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA 200 
TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 250 
10 ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 300 
AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC 350 
. TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 400 
GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 450 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA 500 
15 AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC 550 
AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 600 . 
TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT 650 
CCATAGAAGA CACCGGGACC GATCCAGCCT CCGCGGCCGG GAACGGTGCA 700 
TTGGAACGCG GATTCCCCGT GCCAAGAGTG ACGTAAGTAC CGCCTATAGA 750 
20 GTCTATAGGC CCACCCCCTT GGCTTCGTTA GAACGCGGCT ACAATTAATA 800 



CATAACCTTA TGTATCATAC ACATACGATT TAGGTGACAC TATAGAATAA 850 



* 
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CATCCACTTT GCCTTTCTCT CCACAGGTGT CCACTCCCAG GTCCAACTGC 900 

ACCTCGGTTC TATCGATTGA ATTCGCGGCC GCTCGGGTCG GACCCACGCG 950 

CAGCGGCCGG AGATGCAGCG GGGCGCCGCG CTGTGCCTGC GACTGTGGCT 1000 

CTGCCTGGGA CTCCTGGACG GCCTGGTGAG TGGCTACTCC ATGACCCCCC 1050 

5 CGACCTTGAA CATCACGGAG GAGTCACACG TCATCGACAC CGGTGACAGC 1100 

CTGTCCATCT CCTGCAGGGG ACAGCACCCC CTCGAGTGGG CTTGGCCAGG 1150 

AGCTCAGGAG GCGCCAGCCA CCGGAGACAA GGACAGCGAG GACACGGGGG 1200 

TGGTGCGAGA CTGCGAGGGC ACAGACGCCA GGCCCTACTG CAAGGTGTTG 1250 

CTGCTGCACG AGGTACATGC CAACGACACA GGCAGCTACG TCTGCTACTA 1300 

10 CAAGTACATC AAGGCACGCA TCGAGGGCAC CACGGCCGCC AGCTCCTACG 1350 
TGTTCGTGAG AGACTTTGAG CAGCCATTCA TCAACAAGCC TGACACGCTC 1400 
TTGGTCAACA GGAAGGACGC CATGTGGGTG CCCTGTCTGG TGTCCATCCC 1450 
CGGCCTCAAT GTCACGCTGC GCTCGCAAAG CTCGGTGCTG TGGCCAGACG 1500 
GGCAGGAGGT GGTGTGGGAT GACCGGCGGG GCATGCTCGT GTCCACGCCA 1550 

15 CTGCTGCACG ATGCCCTGTA CCTGCAGTGC GAGACCACCT GGGGAGACCA 1600 
GGACTTCCTT TCCAACCCCT TCCTGGTGCA CATCACAGGC AACGAGCTCT 1650 
ATGACATCCA GCTGTTGCCC AGGAAGTCGC TGGAGCTGCT GGTAGGGGAG 1700 
AAGCTGGTCC TGAACTGCAC CGTGTGGGCT GAGTTTAACT CAGGTGTCAC 1750 
CTTTGACTGG GACTACCCAG GGAAGCAGGC AGAGCGGGGT AAGTGGGTGC 1800 
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CCGAGCGACG CTCCCAGCAG ACCCACACAG AACTCTCCAG CATCCTGACC 1850 
ATCCACAACG TCAGCCAGCA CGACCTGGGC TCGTATGTGT GCAAGGCCAA 1900 
CAACGGCATC CAGCGATTTC GGGAGAGCAC CGAGGTCATT GTGCATGAAA 1950 
ATCCCTTCAT CAGCGTCGAG TGGCTCAAAG GACCCATCCT GGAGGCCACG 2000 
5 GCAGGAGACG AGCTGGTGAA GCTGCCCGTG AAGCTGGCAG CGTACCCCCC 2050 
GCCCGAGTTC CAGTGGTACA AGGATGGAAA GGCACTGTCC GGGCGCCACA 2100 
GTCCACATGC CCTGGTGCTC AAGGAGGTGA CAGAGGCCAG CACAGGCACC 2150 
TACACCCTCG CCCTGTGGAA CTCCGCTGCT GGCCTGAGGC GCAACATCAG 2200 
CCTGGAGCTG GTGGTGAATG TGCCCCCCCA GATACATGAG AAGGAGGCCT 2250 
10 CCTCCCCCAG CATCTACTCG CGTCACAGCC GCCAGGCCCT CACCTGCACG 2300 
GCCTACGGGG TGCCCCTGCC TCTCAGCATC CAGTGGCACT GGCGGCCCTG 2350 
GACACCCTGC AAGATGTTTG CCCAGCGTAG TCTCCGGCGG CGGCAGCAGC 2400 
AAGACCTCAT GCCACAGTGC CGTGACTGGA GGGCGGTGAC CACGCAGGAT 2450 
GCCGTGAACC CCATCGAGAG CCTGGACACC TGGACCGAGT TTGTGGAGGG 2500 
15 AAAGAATAAG ACTGTGAGCA AGCTGGTGAT CCAGAATGCC AACGTGTCTG 2550 
CCATGTACAA GTGTGTGGTC TCCAACAAGG TGGGCCAGGA TGAGCGGCTC 2600 
ATCTACTTCT ATGTGACCAC CATCCCCGAC GGCTTCACCA TCGAATCCAA 2650 
GCCATCCGAG GAGCTACTAG AGGGCCAGCC GGTGCTCCTG AGCTGCCAAG 2700 



CCGACAGCTA CAAGTACGAG CATCTGCGCT GGTACCGCCT CAACCTGTCC 2750 
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ACGCTGCACG ATGCGCACGQ GAACCCGCTT CTGCTCGACT GCAAGAACGT 2800 
GCATCTGTTC GCCACCCCTC TGGCCGCCAG CCTGGAGGAG GTGGCACCTG 2850 
GGGCGCGCCA CGCCACGCTC AGCCTGAGTA TCCCCCGCGT CGCGCCCGAG 2900 
CACGAGGGCC ACTATGTGTG CGAAGTGCAA GACCGGCGCA GCCATGACAA 2950 
5 GCACTGCCAC AAGAAGTACC TGTCGGTGCA GGCCCTGGAA GCCCCTCGGC 3000 
TCACGCAGAA CTTGACCGAC CTCCTGGTGA ACGTGAGCGA CTCGCTGGAG 3050 
ATGCAGTGCT TGGTGGCCGG AGCGCACGCG CCCAGCATCG TGTGGTACAA 3100 
AGACGAGAGG CTGCTGGAGG AAAAGTCTGG AGTCGACTTG GCGGACTCCA 3150 
ACCAGAAGCT GAGCATCCAG CGCGTGCGCG AGGAGGATGC GGGACGCTAT 3200 
10 CTGTGCAGCG TGTGCAACGC CAAGGGCTGC GTCAACTCCT CCGCCAGCGT 3250 
GGCCGTGGAA GGCTCCGAGG ATAAGGGCAG CATGGAGATC GTGATCCTTG 3300 
TCGGTACCGG CGTCATCGCT GTCTTCTTCT GGGTCCTCCT CCTCCTCATC 3350 
TTCTGTAACA TGAGGAGGCC GGCCCACGCA GACATCAAGA CGGGCTACCT 3400 
GTCCATCATC ATGGACCCCG GGGAGGTGCC TCTGGAGGAG CAATGCGAAT 3450 
15 ACCTGTCCTA CGATGCCAGC CAGTGGGAAT TCCCCCGAGA GCGGCTGCAC 3500 
CTGGGGAGAG TGCTCGGCTA CGGCGCCTTC GGGAAGGTGG TGGAAGCCTC 3550 
CGCTTTCGGC ATCCACAAGG GCAGCAGCTG TGACACCGTG GCCGTGAAAA 3600 
TGCTGAAAGA GGGCGCCACG GCCAGCGAGC ACCGCGCGCT GATGTCGGAG 3650 
CTCAAGATCC TCATTCACAT CGGCAACCAC CTCAACGTGG TCAACCTCCT 3700 
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CGGGGCGTGC ACCAA6CCGC AGGGCCCCCT CATGGTGATC GTGGAGTTCT 3750 
GCAAGTACGG CAACCTCTCC AACTTCCTGC GCGCCAAGCG GGACGCCTTC 3800 
AGCCCCTGCG CGGAGAAGTC TCCCGAGCAG CGCGGACGCT TCCGCGCCAT 3850 
GGTGGAGCTC GCCAGGCTGG ATCGGAGGCG GCCGGGGAGC AGCGACAGGG 3900 
5 TCCTCTTCGC GCGGTTCTCG AAGACCGAGG GCGGAGCGAG GCGGGCTTCT 3950 
CCAGACCAAG AAGCTGAGGA CCTGTGGCTG AGCCCGCTGA CCATGGAAGA 4000 
TCTTGTCTGC TACAGCTTCC AGGTGGCCAG AGGGATGGAG TTCCTGGCTT 4050 
CCCGAAAGTG CATCCACAGA GACCTGGCTG CTCGGAACAT TCTGCTGTCG 4100 
GAAAGCGACG TGGTGAAGAT CTGTGACTTT GGCCTTGCCC GGGACATCTA 4150 
10 CAAAGACCCT GACTACGTCC GCAAGGGCAG TGCCCGGCTG CCCCTGAAGT 4200 
GGATGGCCCC TGAAAGCATC TTCGACAAGG TGTACACCAC GCAGAGTGAC 4250 
GTGTGGTCCT TTGGGGTGCT TCTCTGGGAG ATCTTCTCTC TGGGGGCCTC 4300 
CCCGTACCCT GGGGTGCAGA TCAATGAGGA GTTCTGCCAG CGGCTGAGAG 4350 
ACGGCACAAG GATGAGGGCC CCGGAGCTGG CCACTCCCGC CATACGCCGC 4400 
15 ATCATGCTGA ACTGCTGGTC CGGAGACCCC AAGGCGAGAC CTGCATTCTC 4450 
GGAGCTGGTG GAGATCCTGG GGGACCTGCT CCAGGGCAGG GGCCTGCAAG 4500 
AGGAAGAGGA GGTCTGCATG GCCCCGCGCA GCTCTCAGAG CTCAGAAGAG 4550 
GGCAGCTTCT CGCAGGTGTC CACCATGGCC CTACACATCG CCCAGGCTGA 4600 



CGCTGAGGAC AGCCCGCCAA GCCTGCAGCG CCACAGCCTG GCCGCCAGGT 4650 
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ATTACAACTG GGTGTCCTTT CCCGGGTGCC TGGCCAGAGG GGCTGAGACC 4700 
CGTGGTTCCT CCAGGATGAA GACATTTGAG GAATTCCCCA TGACCCCAAC 4750 
GACCTACAAA GGCTCTGTGG ACAACCAGAC AGACAGTGGG ATGGTGCTGG 4800 
CCTCGGAGGA GTTTGAGCAG ATAGAGAGCA GGCATAGACA AGAAAGCGGC 4850 



5 TTCAGGTAGC TGAAGCAGAG AGAGAGAAGG CAGCATACGT CAGCATTTTC 4900 
TTCTCTGCAC TTATAAGAAA GATCAAAGAC TTTAAGACTT TCGCTATTTC 4950 
TTCTGCTATC TACTACAAAC TTCAAAGAGG AACCAGGAGG CCAAGAGGAG 5000 
CATGAAAGTG GACAAGGAGT GTGACCACTG AAGCACCACA GGGAGGGGTT 5050 
AGGCCTCCGG ATGACTGCGG GCAGGCCTGG ATAATATCCA GCCTCCCACA 5100 
10 AGAAGCTGGT GGAGCAGAGT GTTCCCTGAC TCCTCCAAGG AAAGGGAGAC 5150 
GCCCTTTCAT GGTCTGCTGA GTAACAGGTG CCTTCCCAGA CACTGGCGTT 5200 
ACTGCTTGAC CAAAGAGCCC TCAAGCGGCC CTTATGCCAG CGTGACAGAG 5250 
GGCTCACCTC TTGCCTTCTA GGTCACTTCT CACAATGTCC CTTCAGCACC 5300 
TGACCCTGTG CCCGCCAGTT ATTCCTTGGT AATATGAGTA ATACATCAAA 5350 
15 GAGTAGTGCG GCCGCGAATT CCCCGGGGAT CCTCTAGAGT CGACCTGCAG 5400 
AAGCTTGGCC GCCATGGCCC AACTTGTTTA TTGCAGCTTA TAATGGTTAC 5450 
AAATAAAGCA ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT 5500 
GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT TATCATGTCT 5550 
GGATCGGGAA TTAATTCGGC GCAGCACCAT GGCCTGAAAT AACCTCTGAA 5600 
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AGAGGAACTT GGTTAGGTAC CTTCTGAGGC GGAAAGAACC AGCTGTGGAA 5650 

TGTGTGTCAG TTAGGGTGTG GAAAGTCCCC AGGCTCCCCA GCAGGCAGAA 5700 

GTATGCAAAG CATGCATCTC AATTAGTCAG CAACCAGGTG TGGAAAGTCC 5750 

CCAGGCTCCC CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC 5800 

5 AGCAACCATA GTCCCGCCCC TAACTCCGCC CATCCCGCCC CTAACTCCGC 5850 

CCAGTTCCGC CCATTCTCCG CCCCATGGCT GACTAATTTT TTTTATTTAT 5900 

GCAGAGGCCG AGGCCGCCTC GGCCTCTGAG CTATTCCAGA AGTAGTGAGG 5950 

AGGCTTTTTT GGAGGCCTAG GCTTTTGCAA AAAGCTGTTA ACAGCTTGGC 6000 

ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT GGCGTTACCC 6050 

10 AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC 6100 

GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG 6150 

CGAATGGCGC CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT 6200 

CACACCGCAT ACGTCAAAGC AACCATAGTA CGCGCCCTGT AGCGGCGCAT 6250 

TAAGCGCGGC GGGTGTGGTG GTTACGCGCA GCGTGACCGC TACACTTGCC 6300 

15 AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC 6350 

GTTCGCCGGC TTTCCCCGTC AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT 6400 

TCCGATTTAG TGCTTTACGG CACCTCGACC CCAAAAAACT TGATTTGGGT 6450 

GATGGTTCAC GTAGTGGGCC ATCGCCCTGA TAGACGGTTT TTCGCCCTTT 6500 

GACGTTGGAG TCCACGTTCT TTAATAGTGG ACTCTTGTTC CAAACTGGAA 6550 
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CAACACTCAA CCCTATCTCG GGCTATTCTT TTGATTTATA AGGGATTTTG 6600 

CCGATTTCGG CCTATTGGTT AAAAAATGAG CTGATTTAAC AAAAATTTAA 6650 

CGCGAATTTT AACAAAATAT TAACGTTTAC AATTTTATGG TGCACTCTCA 6700 

GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGCCCCG ACACCCGCCA 6750 

5 ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA 6800 

CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC 6850 

CX3TCATCACC GAAACGCGCG AGACGAAAGG GCCTCGTGAT ACGCCTATTT 6900 

TTATAGGTTA ATGTCATGAT AATAATGGTT TCTTAGACGT CAGGTGGCAC 6950 

TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT TTCTAAATAC 7000 

10 ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA 7050 
TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT 7100 
TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA 7150 
CGCTGGTGAA AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT 7200 
TACATCGAAC TGGATCTCAA CAGCGGTAAG ATCCTTGAGA GTTTTCGCCC 7250 

15 CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG 7300 
CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA 7350 
CACTATTCTC AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA 7400 
TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA 7450 
TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 7500 
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AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT 7550 
TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG 7600 
ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT 7650 
GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA 7700 
5 GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CCGGCTGGCT 7750 
GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 7800 
ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA 7850 
CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG 7900 
AGATAGGTGC CTCACTGATT AAGCATTGGT AACTGTCAGA CCAAGTTTAC 7950 

10 TCATATATAC TTTAGATTGA TTTAAAACTT CATTTTTAAT TTAAAAGGAT 8000 
CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG 8050 
AGTTTTCGTT cCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 8100 
TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA 8150 
ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC 8200 

15 TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTT 8250 
CTTCTAGTGT AGCCGTAGTT AGGCCACCAC TTCAAGAACT CTGTAGCACC 8300 
GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT GCTGCCAGTG 8350 
GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 8400 
AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT 8450 
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GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG 8500 

AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 8550 

GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC 8600 

CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC 8650 

5 GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 8700 

AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT 8750 

GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT 8800 

TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 8850 

TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC 8900 

10 CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG TTTCCCGACT 8950 

GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA GCTCACTCAT 9000 

TAGGCACCCC AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG 9050 

AATTGTGAGC GGATAACAAT TTCACACAGG AAACAGCTAT GACATGATTA 9100 

CGAATTAA 9108 
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The invention claimed is: 

1. An agonist antibody which activates the kinase domain of a 
receptor protein tyrosine kinase (pTK) selected from the group 
consisting of: 

5 a) SAL-SI; 

b) HpTK 5; and 

c) bpTK 7. 

2. The antibody of claim 1 comprising a monoclonal antibody. 

3. The antibody of claim 1 wherein the pTK is HpTK5. 

10 4. The antibody of claim 3 having the biological characteristics of 
the antibody produced by the hybridoma cell line deposited under 
American Type Culture Collection Accession No. ATCC HB 11,583. 

5. The antibody of claim 1 wherein the pTK is SAL-SI. 

6. A pharmaceutical composition comprising the antibody of claim 1 in 
15 an amount effective in activating the kinase domain of the 

receptor protein tyrosine kinase (pTK) , and a pharmaceutical^ 
acceptable carrier. 

7. A method for activating the kinase domain of a receptor protein 
tyrosine kinase (pTK) selected from the group consisting of: 

20 a) SAL-SI; 

b) HpTK 5; and 

c) bpTK 7, comprising contacting the pTK with an effective 
amount of an agonist antibody thereto. 

8. A chimeric protein comprising a fusion of the extracellular domain 
25 of a receptor protein tyrosine kinase (pTK) selected from the 

group consisting of: 

a) SAL-SI; 

b) HpTK 5; and 

c) bpTK 7, with an immunoglobulin constant domain sequence. 
30 9. The chimeric protein of claim 8 wherein the pTK is HpTKS . 

10. The chimeric protein of claim 8 wherein the pTK is Sal-Si. 

11. The chimeric protein of claim 8 wherein the immunoglobulin 
constant domain sequence is that of an IgG immunoglobulin. 

12. A nucleic acid encoding the chimeric protein of claim 8. 
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13. A replicable vector comprising the nucleic acid of claim 12. 

14. A recombinant host cell comprising the nucleic acid of claim 12. 

15. A method of using a nucleic acid molecule encoding a chimeric 
protein comprising a fusion of the extracellular domain of a receptor 

5 protein tyrosine kinase (pTK) selected from the group consisting of: 

a) SAL- SI; 

b) HpTK 5; and 

c) bpTK 7, with an immunoglobulin constant domain sequence, to 
effect the production of the chimeric protein comprising culturing the 

10 host cell of claim 14. 
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FIG. 15A 



1 TCGGGTCGGA CCCACGCGCA GCGGCCGGAG ATGCAGCGGG GCGCCGCGCT GTGCCTGCGA 
AGCCCAGCCT GGGTGCGCGT CGCCGGCCTC TACGTCGCCC CGCGGCGCGA CACGGACGCT 
1 MQRGAALCLR 

61 CTGTGGCTCT GCCTGGGACT CCTGGACGGC CTGGTGAGTG GCTACTCCAT GACCCCCCCG 
GACACCGAGA CGGACCCTGA GGACCTGCCG GACCACTCAC CGATGAGGTA CTGGGGGGGC 
11LWLC LGL LDG LVSG YSM TPP 

121 ACCTTGAACA TCACGGAGGA GTCACACGTC ATCGACACCG GTGACAGCCT GTCCATCTCC 
TGGAACTTGT AGTGCCTCCT CAGTGTGCAG TAGCTGTGGC CACTGTCGGA CAGGTAGAGG 
31TLNI TEE SHV IDTG DSL SIS 

181 TGCAGGGGAC AGCACCCCCT CGAGTGGGCT TGGCCAGGAG CTCAGGAGGC GCCAGCCACC 
ACGTCCCCTG TCGTGGGGGA GCTCACCCGA ACCGGTCCTC GAGTCCTCCG CGGTCGGTGG 
51CRGQ HPL EWA WPGA QEA PAT 

241 GGAGACAAGG ACAGCGAGGA CACGGGGGTG GTGCGAGACT GCGAGGGCAC AGACGCCAGG 
CCTCTGTTCC TGTCGCTCCT GTGCCCCCAC CACGCTCTGA CGCTCCCGTG TCTGCGGTCC 
71GDKD SED TGV VRDC EGT DAR 

301 CCCTACTGCA AGGTGTTGCT GCTGCACGAG GTACATGCCA ACGACACAGG CAGCTACGTC 
GGGATGACGT TCCACAACGA CGACGTGCTC CATGTACGGT TGCTGTGTCC GTCGATGCAG 
91PYCK VLL LHE VHAN DTG SYV 

361 TGCTACTACA AGTACATCAA GGCACGCATC GAGGGCACCA CGGCCGCCAG CTCCTACGTG 
ACGATGATGT TGATGTAGTT CCGTGCGTAG CTCCCGTGGT GCCGGCGGTC GAGGATGCAC 
111CYYK YIK A R I EGTT A A S SYV 

421 TTCGTGAGAG ACTTTGAGCA GCCATTCATC AACAAGCCTG ACACGCTCTT GGTCAACAGG 
AAGCACTCTC TGAAACTCGT CGGTAAGTAG TTGTTCGGAC TGTGCGAGAA CCAGTTGTCC 
131 F V R D FEQ PFI NKPD T L L VNR 

481 AAGGACGCCA TGTGGGTGCC CTGTCTGGTG TCCATCCCCG GCCTCAATGT CACGCTGCGC 
TTCCTGCGGT ACACCCACGG GACAGACCAC AGGTAGGGGC CGGAGTTACA GTGCGACGCG 
151KDAM WVP C L V SIPG LNV TLR 

541 TCGCAAAGCT CGGTGCTGTG GCCAGACGGG CAGGAGGTGG TGTGGGATGA CCGGCGGGGC 
AGCGTTTCGA GCCACGACAC CGGTCTGCCC GTCCTCCACC ACACCCTACT GGCCGCCCCG 
171SQSS VLW PDG QEVV WDD RRG 

601 ATGCTCGTGT CCACGCCACT GCTGCACGAT GCCCTGTACC TGCAGTGCGA GACCACCTGG 
TACGAGCACA GGTGCGGTGA CGACGTGCTA CGGGACATGG ACGTCACGCT CTGGTGGACC 
191MLVS TPL L H D A L Y L QCE TTW 

661 GGAGACCAGG ACTTCCTTTC CAACCCCTTC CTGGTGCACA TCACAGGCAA CGAGCTCTAT 
CCTCTGGTCC TGAAGGAAAG GTTGGGGAAG GACCACGTGT AGTGTCCGTT GCTCGAGATA 
211GDQD FLS NPF LVHI TGN ELY 
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FIG. 15B 



721 GACATCCAGC TGTTGCCCAG GAAGTCGCTG GAGCTGCTGG TAGGGGAGAA GCTGGTCCTG 
CTGTAGGTCG ACAACGGGTC CTTCAGCGAC CTCGACGACC ATCCCCTCTT CGACCAGGAC 
231DIQL LPR K S L E L L V GEK LVL 

781 AACTGCACCG TGTGGGCTGA GTTTAACTCA GGTGTCACCT TTGACTGGGA CTACCCAGGG 
TTGACGTGGC ACACCCGACT CAAATTGAGT CCACAGTGGA AACTGACCCT GATGGGTCCC 
251NCTV WAE FNS GVTF DWD YPG 

841 AAGCAGGCAG AGCGGGGTAA GTGGGTGCCC GAGCGACGCT CCCAGCAGAC CCACACAGAA 
TTCGTCCGTC TCGCCCCATT CACCCACGGG CTCGCTGCGA GGGTCGTCTG GGTGTGTCTT 
271 K Q A E RGK WVP ERRS QQT HTE 

901 CTCTCCAGCA TCCTGACCAT CCACAACGTC AGCCAGCACG ACCTGGGCTC GTATGTGTGC 
GAGAGGTCGT AGGACTGGTA GGTGTTGCAG TCGGTCGTGC TGGACCCGAG CATACACACG 
291LSSI L T I HNV SQHD LGS YVC 

961 AAGGCCAACA ACGGCATCCA GCGATTTCGG GAGAGCACCG AGGTCATTGT GCATGAAAAT 
TTCCGGTTGT TGCCGTAGGT CGCTAAAGCC CTCTCGTGGC TCCAGTAACA CGTACTTTTA 
311KANN GIQ RFR ESTE VIV HEN 

1021 CCCTTCATCA GCGTCGAGTG GCTCAAAGGA CCCATCCTGG AGGCCACGGC AGGAGACGAG 
GGGAAGTAGT CGCAGCTCAC CGAGTTTCCT GGGTAGGACC TCCGGTGCCG TCCTCTGCTC 
331PFIS VEW L K G PILE ATA GDE 

1081 CTGGTGAAGC TGCCCGTGAA GCTGGCAGCG TACCCCCCGC CCGAGTTCCA GTGGTACAAG 
GACCACTTCG ACGGGCACTT CGACCGTCGC ATGGGGGGCG GGCTCAAGGT CACCATGTTC 
351 L V K L PVK L A A YPPP EFQ WYK 

1141 GATGGAAAGG CACTGTCCGG GCGCCACAGT CCACATGCCC TGGTGCTCAA GGAGGTGACA 
CTACCTTTCC GTGACAGGCC CGCGGTGTCA GGTGTACGGG ACCACGAGTT CCTCCACTGT 
371DGKA L S G RHS PHAL VLK EVT 

1201 GAGGCCAGCA CAGGCACCTA CACCCTCGCC CTGTGGAACT CCGCTGCTGG CCTGAGGCGC 
CTCCGGTCGT GTCCGTGGAT GTGGGAGCGG GACACCTTGA GGCGACGACC GGACTCCGCG 
391 EAST GTY TLA LWNS A A G LRR 

1261 AACATCAGCC TGGAGCTGGT GGTGAATGTG CCCCCCCAGA TACATGAGAA GGAGGCCTCC 
TTGTAGTCGG ACCTCGACCA CCACTTACAC GGGGGGGTCT ATGTACTCTT CCTCCGGAGG 
411 NISL ELV V N V PPQI HEK EAS 

1321 TCCCCCAGCA TCTACTCGCG TCACAGCCGC CAGGCCCTCA CCTGCACGGC CTACGGGGTG 
AGGGGGTCGT AGATGAGCGC AGTGTCGGCG GTCCGGGAGT GGACGTGCCG GATGCCCCAC 
431SPSI YSR HSR QALT CTA YGV 

1381 CCCCTGCCTC TCAGCATCCA GTGGCACTGG CGGCCCTGGA CACCCTGCAA GATGTTTGCC 
GGGGACGGAG AGTCGTAGGT CACCGTGACC GCCGGGACCT GTGGGACGTT CTACAAACGG 
451PLPL SIQ WHW RPWT PCK MFA 

1441 CAGCGTAGTC TCCGGCGGCG GCAGCAGCAA GACCTCATGC CACAGTGCCG TGACTGGAGG 
GTCGCATCAG AGGCCGCCGC CGTCGTCGTT CTGGAGTACG GTGTCACGGC ACTGACCTCC 
471QRSL RRR QQQ DLMP QCR DWR 
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FIG. 15C 

1501 GCGGTCACCA CGCAGGATGC CGTGAACCCC ATCGAGAGCC TGGACACCTG GACCGAGTTT 
CGCCACTGGT GCGTCCTACG GCACTTGGGG TAGCTCTCGG ACCTGTGGAC CTGGCTCAAA 
491 AVTT QDA VNP IESL DTW TEF 

1561 GTGGAGGGAA AGAATAAGAC TGTGAGCAAG CTGGTGATCC AGAATGCCAA CGTGTCTGCC 
CACCTCCCTT TCTTATTCTG ACACTCGTTC GACCACTAGG TCTTACGGTT GCACAGACGG 
511VEG K NKT V SK L V I Q NAN VSA 

1621 ATGTACAAGT GTGTGGTCTC CAACAAGGTG GGCCAGGATG AGCGGCTCAT CTACTTCTAT 
TACATGTTCA CACACCAGAG GTTGTTCCAC CCGGTCCTAC TCGCCGAGTA GATGAAGATA 
531MYKC V V S NKV GQDE RLI Y F Y 

1681 GTGACCACCA TCCCCGACGG CTTCACCATC GAATCCAAGC CATCCGAGGA GCTACTAGAG 
CACTGGTGGT AGGGGCTGCC GAAGTGGTAG CTTAGGTTCG GTAGGCTCCT CGATGATCTC 
551 VTTI PDG FTI ESKP SEE LLE 

1741 GGCCAGCCGG TGCTCCTGAG CTGCCAAGCC GACAGCTACA AGTACGAGCA TCTGCGCTGG 
CCGGTCGGCC ACGAGGACTC GACGGTTCGG CTGTCGATGT TCATGCTCGT AGACGCGACC 
571GQPV L L S CQA DSYK YEH L R W 

1801 TACCGCCTCA ACCTGTCCAC GCTGCACGAT GCGCACGGGA ACCCGCTTCT GCTCGACTGC 
ATGGCGGAGT TGGACAGGTG CGACGTGCTA CGCGTGCCCT TGGGCGAAGA CGAGCTGACG 
591YRLN LST L H D AHGN PLL L D C 

1861 AAGAACGTGC ATCTGTTCGC CACCCCTCTG GCCGCCAGCC TGGAGGAGGT GGCACCTGGG 
TTCTTGCACG TAGACAAGCG GTGGGGAGAC CGGCGGTCGG ACCTCCTCCA CCGTGGACCC 
611KNVH LFA TPL AASL EEV APG 

1921 GCGCGCCACG CCACGCTCAG CCTGAGTATC CCCCGCGTCG CGCCCGAGCA CGAGGGCCAC 
CGCGCGGTGC GGTGCGAGTC GGACTCATAG GGGGCGCAGC GCGGGCTCGT GCTCCCGGTG 
631 A R H A TLS LSI PRVA PEH EGH 

1981 TATGTGTGCG AAGTGCAAGA CCGGCGCAGC CATGACAAGC ACTGCCACAA GAAGTACCTG 
ATACACACGC TTCACGTTCT GGCCGCGTCG GTACTGTTCG TGACGGTGTT CTTCATGGAC 
651YVCE VQD RRS HDKH CHK KYL 

2041 TCGGTGCAGG CCCTGGAAGC CCCTCGGCTC ACGCAGAACT TGACCGACCT CCTGGTGAAC 
AGCCACGTCC GGGACCTTCG GGGAGCCGAG TGCGTCTTGA ACTGGCTGGA GGACCACTTG 
671 S V Q A LEA PRL TQNL TDL LVN 

2101 GTGAGCGACT CGCTGGAGAT GCAGTGCTTG GTGGCCGGAG CGCACGCGCC CAGCATCGTG 
CACTCGCTGA GCGACCTCTA CGTCACGAAC CACCGGCCTC GCGTGCGCGG GTCGTAGCAC 
691VSDS LEM QCL VAGA HAP S I V 

2161 TGGTACAAAG ACGAGAGGCT GCTGGAGGAA AAGTCTGGAG TCGACTTGGC GGACTCCAAC 
ACCATGTTTC TGCTCTCCGA CGACCTCCTT TTCAGACCTC AGCTGAACCG CCTGAGGTTG 
711 WYKD ERL LEE KSGV DLA DSN 

2221 CAGAAGCTGA GCATCCAGCG CGTGCGCGAG GAGGATGCGG GACGCTATCT GTGCAGCGTG 
GTCTTCGACT CGTAGGTCGC GCACGCGCTC CTCCTACGCC CTGCGATAGA CACGTCGCAC 
731 QKLS IQR VRE EDAG RYL CSV 
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FIG. 15D 

2281 TGCAACGCCA AGGGCTGCGT CAACTCCTCC GCCAGCGTGG CCGTGGAAGG CTCCGAGGAT 
l ACGTTGCGGT TCCCGACGCA GTTGAGGAGG CGGTCGCACC GGCACCTTCC GAGGCTCCTA 

751CNAK GCV NSS ASVA VEG SED 

2341 AAGGGCAGCA TGGAGATCGT GATCCTTGTC GGTACCGGCG TCATCGCTGT CTTCTTCTGG 
TTCCCGTCGT ACCTCTAGCA CTAGGAACAG CCATGGCCGC AGTAGCGACA GAAGAAGACC 
771 KGS M EIV I L V GTGV IAV FFW 

2401 GTCCTCCTCC TCCTCATCTT CTGTAACATG AGGAGGCCGG CCCACGCAGA CATCAAGACG 
CAGGAGGAGG AGGAGTAGAA GACATTGTAC TCCTCCGGCC GGGTGCGTCT GTAGTTCTGC 
791VLLL L I F CNM RRPA HAD IKT 

2461 GGCTACCTGT CCATCATCAT GGACCCCGGG GAGGTGCCTC TGGAGGAGCA ATGCGAATAC 
CCGATGGACA GGTAGTAGTA CCTGGGGCCC CTCCACGGAG ACCTCCTCGT TACGCTTATG 
811 GYLS IIM DPG EVPL EEQ CEY 

2521 CTGTCCTACG ATGCCAGCCA GTGGGAATTC CCCCGAGAGC GGCTGCACCT GGGGAGAGTG 
GACAGGATGC TACGGTCGGT CACCCTTAAG GGGGCTCTCG CCGACGTGGA CCCCTCTCAC 
831LSYD ASQ WEF PRER LHL GRV 

2581 CTCGGCTACG GCGCCTTCGG GAAGGTGGTG GAAGCCTCCG CTTTCGGCAT CCACAAGGGC 
GAGCCGATGC CGCGGAAGCC CTTCCACCAC CTTCGGAGGC GAAAGCCGTA GGTGTTCCCG 
851LGYG AFG KVV EASA FGI HKG 

2641 AGCAGCTGTG ACACCGTGGC CGTGAAAATG CTGAAAGAGG GCGCCACGGC CAGCGAGCAC 
TCGTCGACAC TGTGGCACCG GCACTTTTAC GACTTTCTCC CGCGGTGCCG GTCGCTCGTG 
871SSCD TVA VKM LKEG ATA SEH 

2701 CGCGCGCTGA TGTCGGAGCT CAAGATCCTC ATTCACATCG GCAACCACCT CAACGTGGTC 
GCGCGCGACT ACAGCCTCGA GTTCTAGGAG TAAGTGTAGC CGTTGGTGGA GTTGCACCAG 
891 RALM S E L K I L IHIG NHL NVV 

2761 AACCTCCTCG GGGCGTGCAC CAAGCCGCAG GGCCCCCTCA TGGTGATCGT GGAGTTCTGC 
TTGGAGGAGC CCCGCACGTG GTTCGGCGTC CCGGGGGAGT ACCACTAGCA CCTCAAGACG 
911 NLLG ACT KPQ GPLM VIV EF.C 

2821 AAGTACGGCA ACCTCTCCAA CTTCCTGCGC GCCAAGCGGG ACGCCTTCAG CCCCTGCGCG 
TTCATGCCGT TGGAGAGGTT GAAGGACGCG CGGTTCGCCC TGCGGAAGTC GGGGACGCGC 
931 KYGN LSN FLR AKRD AFS PCA 

2881 GAGAAGTCTC CCGAGCAGCG CGGACGCTTC CGCGCCATGG TGGAGCTCGC CAGGCTGGAT 
CTCTTCAGAG GGCTCGTCGC GCCTGCGAAG GCGCGGTACC ACCTCGAGCG GTCCGACCTA 
951 EKSP EQR GRF RAMV ELA RLD 

2941 CGGAGGCGGC CGGGGAGCAG CGACAGGGTC CTCTTCGCGC GGTTCTCGAA GACCGAGGGC 
GCCTCCGCCG GCCCCTCGTC GCTGTCCCAG GAGAAGCGCG CCAAGAGCTT CTGGCTCCCG 
971RRRP GSS DRV LFAR FSK TEG 

3001 GGAGCGAGGC GGGCTTCTCC AGACCAAGAA GCTGAGGACC TGTGGCTGAG CCCGCTGACC 
CCTCGCTCCG CCCGAAGAGG TCTGGTTCTT CGACTCCTGG ACACCGACTC GGGCGACTGG 
991 GARR ASP DQE AEDL WLS PLT 
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FIG. 15E 

3061 ATGGAAGATC TTGTCTGCTA CAGCTTCCAG GTGGCCA6AG GGATGGAGTT CCTGGCTTCC 
TACCTTCTAG AACAGACGAT GTCGAAGGTC CACCGGTCTC CCTACCTCAA GGACCGAAGG 
1011 MEDL VCY SFQ VARG MEF LAS 

3121 CGAAAGTGCA TCCACAGAGA CCTGGCTGCT CGGAACATTC TGCTGTCGGA AAGCGACGTG 
GCTTTCACGT AGGTGTCTCT GGACCGACGA GCCTTCTAAG ACGACAGCCT TTCGCTGCAC 
1031 RKC I HRD LAA RNIL L S E SDV 

3181 GTGAAGATCT GTGACTTTGG CCTTGCCCGG GACATCTACA AAGACCCTGA CTACGTCCGC 
CACTTCTAGA CACTGAAACC GGAACGGGCC CTGTAGATGT TTCTGGGACT GATGCAGGCG 
1051 VKIC DFG L A R DIYK DPD Y V R 

3241 AAGGGCAGTG CCCGGCTGCC CCTGAAGTGG ATGGCCCCTG AAAGCATCTT CGACAAGGTG 
TTCCCGTCAC GGGCCGACGG GGACTTCACC TACCGGGGAC TTTCGTAGAA GCTGTTCCAC 
1071 KGSA RLP LKW MAPE SIF DKV 

3301 TACACCACGC AGAGTGACGT GTGGTCCTTT GGGGTCCTTC TCTGGGAGAT CTTCTCTCTC 
ATGTGGTGCG TCTCACTGCA CACCAGGAAA CCCCACGAAG AGACCCTCTA GAAGAGAGAC 
1091 YTTQ SDV WSF GVLL WEI FSL 

3361 GGGGCCTCCC CGTACCCTGG GGTGCAGATC AATGAGGAGT TCTGCCAGCG GCTGAGAGAC 
CCCCGGAGGG GCATGGGACC CCACGTCTAG TTACTCCTCA AGACGGTCGC CGACTCTCTG 
1111 GASP YPG VQI NEEF CQR LRD 

3421 GGCACAAGGA TGAGGGCCCC GGAGCTGGCC ACTCCCGCCA TACGCCGCAT CATGCTGAAC 
CCGTGTTCCT ACTCCCGGGG CCTCGACCGG TGAGGGCGGT ATGCGGCGTA GTACGACTTG 
1131 GTRM RAP ELA TPAI RRI MLN 

3481 TGCTGGTCCG GAGACCCCAA GGCGAGACCT GCATTCTCGG AGCTGGTGGA GATCCTGGGG 
ACGACCAGGC CTCTGGGGTT CCGCTCTGGA CGTAAGAGCC TCGACCACCT CTAGGACCCC 
1151 CWSG DPK ARP AFSE LVE ILG 

3541 GACCTGCTCC AGGGCAGGGG CCTGCAAGAG GAAGAGGAGG TCTGCATGGC CCCGCGCAGC 
. CTGGACGAGG TCCCGTCCCC GGACGTTCTC CTTCTCCTCC AGACGTACCG GGGCGCGTCG 
1171 DLLQ GRG LQE EEEV CMA PRS 

3601 TCTCAGAGCT CAGAAGAGGG CAGCTTCTCG CAGGTGTCCA CCATGGCCCT ACACATCGCC 
AGAGTCTCGA GTCTTCTCCC GTCGAAGAGC GTCCACAGGT GGTACCGGGA TGTGTAGCGG 
1191 SQSS EEG SFS QVST MAL HIA 

3661 CAGGCTGACG CTGAGGACAG CCCGCCAAGC CTGCAGCGCC ACAGCCTGGC CGCCAGGTAT 
GTCCGACTGC GACTCCTGTC GGGCGGTTCG GACGTCGCGG TGTCGGACCG GCGGTCCATA 
1211 QADA EDS PPS LQRH SLA ARY 

3721 TACAACTGGG TGTCCTTTCC CGGGTGCCTG GCCAGAGGGG CTGAGACCCG TGGTTCCTCC 
ATGTTGACCC ACAGGAAAGG GCCCACGGAC CGGTCTCCCC GACTCTGGGC ACCAAGGAGG 
1231 YNWV SFP GCL ARGA ETR GSS 

3781 AGGATGAAGA CATTTGAGGA ATTCCCCATG ACCCCAACGA CCTACAAAGG CTCTGTGGAC 
TCCTACTTCT GTAAACTCCT TAAGGGGTAC TGGGGTTGCT GGATGTTTCC GAGACACCTG 
1251 RMKT FEE FPM TPTT YKG SVD 
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FIG. 15F 



3841 AACCAGACAG ACAGTGGGAT GGTGCTGGCC TCGGAGGAGT TTGAGCAGAT AGAGAGCAGG 
TTGGTCTGTC TGTCACCCTA CCACGACCGG AGCCTCCTCA AACTCGTCTA TCTCTCGTCC 
1271 NQTD SGM V L A SEEF EQI ESR 

3901 CATAGACAAG AAAGCGGCTT CAGGTAGCTG AAGCAGAGAG AGAGAAGGCA GCATACGTCA 
GTATCTGTTC TTTCGCCGAA GTCCATCGAC TTCGTCTCTC TCTCTTCCGT CGTATGCAGT 
1291 HRQESGFRO 

3961 GCATTTTCTT CTCTGCACTT ATAAGAAAGA TCAAAGACTT TAAGACTTTC GCTATTTCTT 
CGTAAAAGAA GAGACGTGAA TATTCTTTCT AGTTTCTGAA ATTCTGAAAG CGATAAAGAA 

4021 CTCCTATCTA CTACAAACTT CAAAGAGGAA CCAGGAGGCC AAGAGGAGCA TGAAAGTGGA 
GACGATAGAT GATGTTTGAA GTTTCTCCTT GGTCCTCCGG TTCTCCTCGT ACTTTCACCT 

4081 CAAGGAGTGT GACCACTGAA GCACCACAGG GAGGGGTTAG GCCTCCGGAT GACTGCGGGC 
GTTCCTCACA CTGGTGACTT CGTGGTGTCC CTCCCCAATC CGGAGGCCTA CTGACGCCCG 

4141 AGGCCTGGAT AATATCCAGC CTCCCACAAG AAGCTGGTGG AGCAGAGTGT TCCCTGACTC 
TCCGGACCTA TTATAGGTCG GAGGGTGTTC TTCGACCACC TCGTCTCACA AGGGACTGAG 

4201 CTCCAAGGAA AGGGAGACGC CCTTTCATGG TCTGCTGAGT AACAGGTGCC TTCCCAGACA 
GAGGTTCCTT TCCCTCTGCG GGAAAGTACC AGACGACTCA TTGTCCACGG AAGGGTCTGT 

4261 CTGGCGTTAC TGCTTGACCA AAGAGCCCTC AAGCGGCCCT TATGCCAGCG TGACAGAGGG 
GACCGCAATG ACGAACTGGT TTCTCGGGAG TTCGCCGGGA ATACGGTCGC ACTGTCTCCC 

4321 CTCACCTCTT GCCTTCTAGG TCACTTCTCA CAATGTCCCT TCAGCACCTG ACCCTCTCCC 
GAGTGGAGAA CGGAAGATCC AGTGAAGAGT GTTACAGGGA AGTCGTGGAC TGGGACACGG 

4381 CGCCAGTTAT TCCTTGGTAA TATGAGTAAT ACATCAAAGA GTAGT 
GCGGTCAATA AGGAACCATT ATACTCATTA TGTAGTTTCT CATCA 
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FIG. 16A 

1 ATGGCTGGGA TTTTCTATTT CGCCCTATTT TCGTGTCTCT TCGGGATTTC 
TAQCGACCCT AAAAGATAAA GCGGGATAAA AGCACAGAGA AGCCCTAAAC 

1 MetAlaGlyl lePheTyrPh eAlaLeuPhe SerCysLeuP heGlylleCy 
CGACGCTGTC ACAGGTTCCA GGGTATACCC CGCGAATGAA GTTACCTTAT 
GCTGCGACAG TGTCCAAGGT CCCATATGGG GCGCTTACTT CAATGGAATA 
sAspAlaVal ThrGlySerA rgValTyrPr oAlaAsnGlu ValThrLeuLeu 

101 TGGATTCCAG ATCTGTTCAG GGAGAACTTG GGTGGATAGC AAGCCCTCTG 
ACCTAAGGTC TAGACAAGTC CCTCTTGAAC CCACCTATCG TTCGGGAGAC 
35 AspSerAr gSerValGln GlyGluLeuG lyTrpIleAl aSerProLeu 
GAAGGAGGGT GGGAGGAAGT GAGTATCATG GATGAAAAAA ATACACCAAT 
CTTCCTCCCA CCCTCCTTCA CTCATAGTAC CTACTTTTTT TATGTGGTTA 
GluGlyGlyT rpGluGluVa iSerll^Met AspGluLysA saThrProIle 

201 CCGAACCTAC CAAGTGTGCA ATGTGATGGA ACCCAGCCAG AATAACTGGC 
GGCTTGGATG GTTCACACGT TACACTACCT TGGGTCGGTC TTATTGACCG 
68 ArgThrTyr GlnValCysA snValMetGl uProSerGln AsnAsnTrpL 
TACGAACTGA TTGGATCACC CGAGAAGGGG CTCAGAGGGT GTATATTGAG 
ATGCTTGACT AACCTAGTGG GCTCTTCCCC GAGTCTCCCA CATATAACTC 
euArgThrAs pTrpIleThr ArgGluGlyA laGlnArgVa lTyrlleGlu 

301 ATTAAATTCA CCTTGAGGGA CTGCAATAGT CTTCCGGGCG TCATGGGGAC 
TAATTTAAGT GGAACTCCCT GACGTTATCA GAAGGCCCGC AGTACCCCTG 

101 IleLysPheT hrLeuArgAs pCysAsnSer LeuProGlyV alMetGlyTh 
TTGCAAGGAG ACGTTTAACC TGTACTACTA TGAATCAGAC AACGACAAAG 
AACGTTCCTC TGCAAATTGG ACATGATGAT ACTTAGTCTG TTGCTGTTTC 
rCysLysGlu ThrPheAsnL euTyrTyrTy rGluSerAsp AsnAspLysGlu 
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FIG. 16B 

401 AGCGTTTCAT CAGAGAGAAC CAGTTTGTCA AAATTGACAC CATTGCTGCT 
TCGCAAAGTA GTCTCTCTTG GTCAAACAGT TTTAACTGTG GTAACGACGA 
135 ArgPhell eArgGluAsn GlnPheValL ysIleAspTh rlleAlaAla 
GATGAGAGCT TCACCCAAGT GGACATTGGT GACAGAATCA TGAAGCTGAA 
CTACTCTCGA AGTGGGTTCA CCTGTAACCA CTGTCTTAGT ACTTCGACTT 
AspGluSerP heThrGlnVa lAspIleGly AspArglleM etLysLeuAsn 

501 CACCGAGATC CGGGATGTAG GGCCATTAAG CAAAAAGGGG TTTTACCTGG 
GTGGCTCTAG GCCCTACATC CCGGTAATTC GTTTTTCCCC AAAATGGACC 

168 ThrG luIle ArgAspValG lyProLeuSe rLysLysGly PheTyrLeuA 
CTTTTCAGGA TCTGGGGGCC TGCATCGCCC TGGTATCAGT CCGTGTGTTC 
GAAAAGTCCT ACACCCCCGG ACGTAGCGGG ACCATAGTCA GGCACACAAG 
laPheGlnAs pValGlyAla CysIleAlaL euValSerVa lArgValPhe 

601 TATAAAAAGT GTCCACTCAC AGTCCGCAAT CTGGCCCAGT TTCCTGACAC 
ATATTTTTCA CAGGTGAGTG TCAGGCGTTA GACCGGGTCA AAGGACTGTG 

201 TyrLysLysC ysProLeuTh rValArgAsn LeuAlaGlnP heProAspTh 
CATCACAGGG GCTGATACGT CTTCCCTGGT GGAAGTTCGA GGCTCCTGTG 
GTAGTGTCCC CGACTATGCA GAAGGGACCA CCTTCAAGCT CCGAGGACAC 
rlleThrGly AlaAspThrS erSerLeuVa lGluValArg GlySerCysVal 

701 TCAACAACTC AGAAGAGAAA GATGTGCCAA AAATGTACTG TGGGGCAGAT 
AGTTGTTGAG TCTTCTCTTT CTACACGGTT TTTACATGAC ACCCCGTCTA 

235 AsnAsnSe rGluGluLys AspValProL ysMetTyrCy sGlyAlaAsp 
GGTGAATGGC TGGTACCCAT TGGCAACTGC CTATGCAACG CTGGGCATGA 
(X^CTTACCG ACCATGGGTA ACCGTTGACG GATACGTTGC GACCCGTACT 
GlyGluTrpL euValProIl eGlyAsnCys LeuCysAsnA laGlyHisGlu 

801 GGAGCGGAGC GGAGAATGCC AAGCTTGCAA AATTGGATAT TACAAGGCTC 
CCTCGCCTCG CCTCTTACGG TTCGAACGTT TTAACCTATA ATGTTCCGAG 

268 GluArgSer GlyGluCysG InAlaCysLy sIleGlyTyr TyrLysAlaL 
TCTCCACGGA TGCCACCTGT GCCAAGTGCC CACCCCACAG CTACTCTGTC 
AGAGGTGCCT ACGGTGGACA CGGTTCACGG GTGGGGTGTC GATGAGACAG 
euSerThrAs pAlaThrCys AlaLysCysP roProHisSe rTyrSerVal 
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FIG. 16C 

901 TGGGAAGGAG CCACCTCGTG CACCTGTGAC CGAGGCTTTT TCAGAGCTGA 
ACCCTTCCTC GGTGGAGCAC GTGGACACTG GCTCCGAAAA AGTCTCGACT 
301 TrpGluGlyA laThrSerCy sThrCysAsp ArgGlyPheP heArgAlaAs 
CAACGATGCT GCCTCTATGC CCTGCACCCG TCCACCATCT GCTCCCCTGA 
GTTGCTACGA CGGAGATACG GGACGTGGGC AGGTGGTAGA CGAGGGGACT 
pAsnAspAla AlaSerMetP roCysThrAr gProProSer AlaProLeuAsn 
1001 ACTTGATTTC AAATGTCAAC GAGACATCTG TGAACTTGGA ATGGAGTAGC 
TGAACTAAAG TTTACAGTTG CTCTGTAGAC ACTTGAACCT TACCTCATCG 
335 LeuIleSe rAsnValAsn GluThrSerV alAsnLeuGl uTrpSerSer 
CCTCAGAATA CAGGTGGCCG CCAGGACATT TCCTATAATG TGGTATGCAA 
GGAGTCTTAT GTCCACCGGC GGTCCTGTAA AGGATATTAC ACCATACCTT 
ProOlnAsnT hrGlyGlyAr gGlnAspIle SerTyrAsnV alValCysLys 
1101 GAAATGTGGA GCTGGTGACC CCAGCAAGTG CCGACCCTGT GGAAGTGGGG 
CTTTACACCT CGACCACTGG GGTCGTTCAC GGCTGGGACA CCTTCACCCC 
368 LysCysGly AlaGlyAspP roSerLysCy sArgProCys GlySerGlyV 
TCCACTACAC CCCACAGCAG AATGGCTTGA AGACCACCAA AGGCTCCATC 
AGGTGATGTG GGGTGTCGTC TTACCGAACT TCTGGTGGTT TCCGAGGTAG 
alHisTyrTh rProGlnGln AsnGlyLeuL ysThrThrLy sGlySerlle 
1201 ACTGACCTCC TAGCTCATAC CAATTACACC TTTGAAATCT GGGCTGTGAA 
TGACTGGAGG ATCGAGTATG GTTAATGTGG AAACTTTAGA CCCGACACTT 
401 ThrAspLeuL euAlaHisTh rAsnTyrThr PheGluIleT rpAlaValAs 
TGGAGTGTCC AAATATAACC CTAACCCAGA CCAATCAGTT TCTGTCACTG 
ACCTCACAGG TTTATATTGG GATTGGGTCT GGTTAGTCAA AGACAGTGAC 
nGlyValSer LysTyrAsnP roAsnProAs pGlnSerVal SerValThrVal 
1301 TGACCACCAA CCAAGCAGCA CCATCATCCA TTGCTTTGGT CCAGGCTAAA 
ACTGGTGGTT GGTTCGTCGT GGTAGTAGGT AACGAAACCA GGTCCGATTT 
435 ThrThrAs nGlnAlaAla ProSerSerl leAlaLeuVa IGlnAlaLys 
GAAGTCACAA GATACAGTGT GGCACTGGCT TGGCTGGAAC CAGATCGGCC 
CTTCAGTGTT CTATGTCACA CCGTGACCGA ACCGACCTTG GTCTAGCCGG 
GluValThrA rgTyrSerVa lAlaLeuAla TrpLeuGluP roAspArgPro 
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FIG. 16D 

1401 CAATGGGOTA ATCCTGGAAT ATGAAGTCAA CTATTATGAG AAGGATCAGA 
GTTACCCCAT TAGGACCTTA TACTTCAGTT CATAATACTC TTCCTAGTCT 
468 AsnGlyVal IleLeuGluT yrGluValLy sTyrTyrGlu LysAspGlnA 
ATGAGGGAAG CTATCGTATA GTTCGGACAG CTGCCAGGAA CACAGATATC 
TACTCGCTTC GATAGCATAT CAAGCCTGTC GACGGTCCTT GTGTCTATAG 
snGluArgSe rTyrArglle ValArgThrA laAlaArgAs nThrAspIle 
1501 AAAGGCCTGA ACCCTCTCAC TTCCTATGTT TTCCACGTGC GAGCCAGGAC 
TTTCCGGACT TGGGAGAGTG AAGGATACAA AAGGTGCACG CTCGGTCCTG 
501 LysGlyLeuA snProLeuTh rSerTyrVal PheHisValA rgAlaArgTh 
AGCAGCTGGC TATGGAGACT TCAGTGAGCC CTTGGAGGTT ACAACCAACA 
TCGTCGACCG ATACCTCTGA AGTCACTCGG GAACCTCCAA TGTTGGTTGT 
rAlaAlaGly TyrGlyAspP heSerGluPr oLeuGluVal ThrThrAsnThr 
1601 CAGTGCCTTC CCGGATCATT GGAGATGGGG CTAACTCCAC AGTCCTTCTG 
GTCACGGAAG GGCCTAGTAA CCTCTACCCC GATTGAGGTG TCAGGAAGAC 
535 ValProSe rArgllelle GlyAspGlyA laAsnSerTh zValLeuLeu 
GTCTCTGTCT CGGGCAGTGT GGTGCTGGTG GTAATTCTCA TTGCAGCTTT 
CAGAGACAGA GCCCGTCACA CCACGACCAC CATTAAGAGT AACGTCGAAA 
ValSerValS erGlySerVa IValLeuVal VallleLeuI leAlaAlaPhe 
1701 TGTCATCAGC CGGAGACGGA GTAAATACAG TAAAGCCAAA CAAGAAGCGG 
ACAGTAGTCG GCCTCTGCCT CATTTATGTC ATTTCGGTTT GTTCTTCGCC 
568 VallleSer ArgArgArgS erLysTyrSe rLysAlaLys GlnGluAlaA 
ATGAAGAGAA ACATTTGAAT CAAGGTGTAA GAACATATGT GGACCCCTTT 
TACTTCTCTT TGTAAACTTA GTTCCACATT CTTGTATACA CCTGGGGAAA 
spGluGluLy sHisLeuAsn GlnGlyValA rgThrTyrVa lAspProPhe 
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FIG. 16E 

1801 ACGTACGAAG ATCCCAACCA AGCAGTGCGA GAGTTTGCCA AAGAAATTGA 
TGCATGCTTC TAGGGTTGGT TCGTCACGCT CTCAAACGGT TT CT T TA ACT 
601 ThrTyxGluA soProAsnGl nAlaValAra GluPheAlaL ysGluIleAs 
CGCATCCTGC ATTAAGATTG AAAAAGTTAT AGGAGTTGGT GAATTTGGTG 
GCGTAGGACG TAATTCTAAC TTTTTCAATA TCCTCAACCA CTTAAACCAC 
pAlaSerCys IleLysIleG luLysValll eGlyValGly GluPheGlyGlu 
1901 AGGTATGCAG TGGGCGTCTC AAAGTGCCTG GCAAGAGAGA GATCTGTGTG 
TCCATACGTC ACCCGCAGAG TTTCACGGAC CGTTCTCTCT CTAGACACAC 
635 ValCysSe rGlyArgLeu LysValProG lyLysArgGl ulleCysVal 
GCTATCAAGA CTCTGAAAGC TGGTTATACA GACAAACAGA GGAGAGACTT 
CGATAGTTCT GAGACTTTCG ACCAATATGT CTGTTTGTCT CCTCTCTGAA 
AlalleLysT hrLeuLysAl aGlyTyrThr AspLysGlnA rgArgAspPhe 
2001 CCTGAGTGAG GCCAGCATCA TGGGACAGTT TGACCATCCG AACATCATTC 
GGACTCACTC CGGTCGTAGT ACCCTGTCAA ACTGGTAGGC TTGTAGTAAG 
668 LeuSerGlu AlaSerlleM etGlyGlnPh eAspHisPro AsnllelleH 
ACTTGGAAGG CGTGGTCACT AAATGTAAAC CAGTAATGAT CATAACAGAG 
TGAACCTTCC GCACCAGTGA TTTACATTTG GTCATTACTA GTATTGTCTC 
isLeuGluGl yValValThr LysCysLysP roValMetll elleThrGlu 
2101 TACATGGAGA ATGGCTCCTT GGATGCATTC CTCAGGAAAA ATGATGGCAG 
ATGTACCTCT TACCGAGGAA CCTACGTAAG GAGTCCTTTT TACTACCGTC 
701 TyrMetGluA snGlySerLe uAspAlaPhe LeuArgLysA snAspGlyAr 
ATTTACAGTC ATTCAGCTGG TGGGCATGCT TCGTGGCATT GGGTCTGGGA 
TAAATGTCAG TAAGTCGACC ACCCGTACGA AGCACCGTAA CCCAGACCCT 
gPheThrVal IleGlnLeuV alGlyMetLe uArgGlylle GlySerGlyMet 
2201 TGAAGTATTT ATCTGATATG AGCTATGTGC ATCGTGATCT GGCCGCACGG 
ACTTCATAAA TAGACTATAC TCGATACACG TAGCACTAGA CCGGCGTGCC 
735 LysTyrLe uSerAspMet SerTyrValH isArgAspLe uAlaAlaArg 
AACATCCTGG TGAACAGCAA CTTGGTCTGC AAAGTGTCTG ATTTTGGCAT 
TTGTAGGACC ACTTGTCGTT GAACCAGACG TTTCACAGAC TAAAACCGTA 
AsnlleLeuV alAsnSerAs nLeuValCys LysValSerA spPheGlyMet 
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FIG. 16F 

2301 GTCCCGAGTG CTTGAGGATG ATCCGGAAGC AGCTTACACC ACCAGGGGTG 
CAGGGCTCAC GAACTCCTAC TAGGCCTTCG TCGAATGTGG TGGTCCCCAC 
768 SerArgVal LeuGluAspA spProGluAl aAlaTyrThr ThrArgGlyG 
GCAAGATTCC TATCCGGTGG ACTGCGCCAG AAGCAATTGC CTATCGTAAA 
CGTTCTAAGG ATAGGCCACC TGACGCGGTC TTCGTTAACG GATAGCATTT 
lyLysIlePr oIleArgTrp ThrAlaProG luAlalleAl aTyrArgLys 

2401 TTCACATCAG CAAGTGATGT ATGGAGCTAT GGAATCGTTA TGTGGGAAGT 
AAGTGTAGTC GTTCACTACA TACCTCGATA CCTTAGCAAT ACACCCTTCA 
801 PheThrSerA laSerAspVa ITrpSerTyr GlylleValM etTrjjGluVa 
GATGTCGTAC GGGGAGAGGC CCTATTGGGA TATGTCCAAT CAAGATGTGA 
CTACAGCATG CCCCTCTCCG GGATAACCCT ATACAGGTTA GTTCTACACT 
lMetSerTyr GlyGluArgP roTyrTrpAs pMetSerAsn GlnAspVallle 

2501 TTAAAGCCAT TGAGGAAGGC TATCGGTTAC CCCCTCCAAT GGACTGCCCC 
AATTTCGGTA ACTCCTTCCG ATAGCCAATG GGGGAGGTTA CCTGACGGGG 
835 LysAlall eGluGluGly TyrArgLeuP roProProMe tAspCysPro 
ATTGCGCTCC ACCAGCTGAT GCTAGACTGC TGGCAGAAGG AGAGGAGCGA 
TAACGCGAGG TGGTCGACTA CGATCTGACG ACCGTCTTCC TCTCCTCGCT 
IleAlaLeuH isGlnLeuMe tLeuAspCys TrpGlnLysG luArgSerAsp 

2601 CAGGCCTAAA TTTGGGCAGA TTGTCAACAT GTTGGACAAA CTCATCCGCA 
GTCCGGATTT AAACCCGTCT AACAGTTGTA CAACCTGTTT GAGTAGGCGT 
868 ArgProLys PheGlyGlnl leValAsnMe tLeuAspLys LeuIleArgA 
ACCCCAACAG CTTGAAGAGG ACAGGGACGG AGAGCTCCAG ACCTAACACT 
TGGGGTTGTC GAACTTCTCC TGTCCCTGCC TCTCGAGGTC TGGATTGTGA 
snProAsnSe rLeuLysArg ThrGlyThrG luSerSerAr gProAsnThr 
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FIG. 16G 



2701 GCCTTGTTGG ATCCAAGCTC CCCTGAATTC TCTGCTGTGG TATCAGTGGG 
CGGAACAACC TAGGTTCGAG GGGACTTAAG AGACGACACC ATAGTCACCC 
901 AlaLeuLeuA spProSerSe rProGluPhe SerAlaValV alSerValGl 
CGATTGGCTC CAGGCCATTA AAATGGACCG GTATAAGGAT AACTTCACAG 
GCTAACCGAG GTCCGGTAAT TTTACCTGGC CATATTCCTA TTGAAGTGTC 
yAspTrpLeu GlnAlalleL ysMetAspAr gTyrLysAsp AsnPheThrAla 
2801 CTGCTGGTTA TACCACACTA GAGGCTGTGG TGCACGTGAA CCAGGAGGAC 
GACGACCAAT ATGGTGTGAT CTCCGACACC ACGTGCACTT GGTCCTCCTG 
935 AlaGlyTy rThrThrLeu GluAlaValV alHisValAs nGlnGluAsp 
CTGGCAAGAA TTGGTATCAC AGCCATCACA CACCAGAATA AGATTTTGAG 
GACCGTTCTT AACCATAGTG TCGGTAGTGT GTGGTCTTAT TCTAAAACTC 
LeuAlaArgI leGlylleTh rAlalleThr HisGlnAsnL ysIleLeuSer 
2901 CAGTGTCCAG GCAATGCGAA CCCAAATGCA GCAGATGCAC GGCAGAATGG 
GTCACAGGTC CGTTACGCTT GGGTTTACGT CGTCTACGTG CCGTCTTACC 
968 SerValGln AlaMetArgT hrGlnMetGl nGlnMetHis GlyArgMetV 
TTCCCGTCTG AGCCAGTACT GAATAAACTC AAAACTCTTG AAATTAGTTT 
AAGGGCAGAC TCGGTCATGA CTTATTTGAG TTTTGAGAAC TTTAATCAAA 
alProValpp *AlaSerThr GluOc*ThrG InAsnSerOp *AsnAm*Phe 
3001 ACCTCATCCA TGCACTTTAA TTGAAGAACT GCACTTTTTT TACTTCGTCT 
TGGAGTAGGT ACGTGAAATT AACTTCTTGA CGTGAAAAAA ATGAAGCAGA 
1001 ThrSerSerM etHisPheAs nOp*ArgThr AlaLeuPheL euLeuArgLe 
TCGCCCTCTG AAATTAAAGA AATGAAAAAA AAAAAACAAT ATCTGCAGCG 
AGCGGGAGAC TTTAATTTCT TTACTTTTTT TTTTTTGTTA TAGACGTCGC 
uArgProLeu LysLeuLysL ysQp*LysLy sLysAsnAsn IleCysSerVal 
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FIG. 16H 

3101 TTGCTTGGTG CACAGATTGC TGAAACTGTG GGGCTTACAG AAATGACTGC 
AACGAACCAC GTGTCTAACG ACTTTGACAC CCCGAATGTC TTTACTGACG 

1035 AlaTroCv sThrAsoCvs Oo*AsnCvsG lvAlaTvrAr aAsnAsDCvs 
CGGTCATTTG AATGAGACCT GGAACAAATC GTTTCTCAGA AGTACTTTTC 
GCCAGTAAAC TTACTCTGGA CCTTGTTTAG CAAAGAGTCT TCATGAAAAG 
AroSerPheG luOo*AsDLe uGluGlnlle ValSerGlnL ysTyrPheSer 

3201 TGTTCATCAC CACTCTGTAA AATACATGTA CCTATAGAAA TAGAACACTG 
ACAAGTAGTG GTCAGACATT TTATGTACAT GGATATCTTT ATCTTGTGAC 

1068 ValHisHis GlnSerValL ysTyrMetTy rLeuAm*Lys Am*AsnThrA 
CCTCTGAGTT TTGATGCTGT ATTTGCTGCC AGACACTGAG CTTCTGAGAC 
GGAGACTCAA AACTACGACA TAAACGACGG TCTGTGACTC GAAGACTCTG 
laSerGluPh eOp*CysCys IleCysCysG InThrLeuSe rPheOp*Asp 

3301 ATCCCTGATT CTCTCTCCAT TTGGAATTAC AACGGTCGAC GAGCTCGA 
TAGGGACTAA GAGAGAGGTA AACCTTAATG TTGCCAGCTG CTCGAGCT 

1101 IleProAspS erLeuSerll eTrpAsnTyr AsnGlyArgA rgAlaArg 
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